WILEY PUBLICATIONS IN PSYCHOLOGY 


FOUNDATIONS OF PSYCHOLOGY 

By Edwin C Boring Herbert S longfeld ond Horry P W«!d 
INTRODUCTION TO PSYCHOLOGY 

By Edw n G Bor ng Herbert S Longfold ond Horry P Wold 

SOCIAL PSYCHOLOGY 

By Daniel Roll and R chard L Schonck 
HEARING— ITS PSYCHOLOGY AND PHYSIOLOGY 
By S Sot ih Sttvtni end Ha down II Davit 
MANUAL OF PSYCHIATRY AND MENTAL HYGIENE Seventh Ed iron 
By Aaron J RotanofT 

STATISTICAL METHODS IN BIOLOGY MEDICINE, AND PSYCHOLOGY 
By C B Davenport and Merl* P Ekoi 
MOTIVATION OF BEHAVIOR 
ByP T Young 

PSYCHOLOGY-* FACTUAL TEXTBOOK 

By Edwn G Bo Ing He bert S Longleld and Horry P Weld 
PSYCHOLOGY IN BUSINESS AND INDUSTRY 
By John G Jenk nt 


HERBERTS LANGFELD 

Ad* lory Ed lor 

METHODS Of PSYCHOLOGY 
T G Andre... Ed lor 

THE PSYCHOLOGY OF EGO-INVOLVEMENTS 
By Muioler SHenf ond Hadley Contrl 
MANUAL OF CHILD PSYCHOLOGY 
Leonard Cormlchoel Ed tor 
[MOTION IN MAN AND ANIMAL 
By P T Young 
UNCONSCIOUSNESS 

By Jamet G orMller 

THE PSYCHOLOGY OF PERSONAt ADJUSTMENT 
By Fred McK nney 

THE PSYCHOLOGY OF SOCIAL MOVEMENTS 
By Hadley Cant I 



HEARING 


Its Psychology and Physiology 


BY 

STANLEY SMITH STEVENS. PH D 

Department of Psychology 
Harvard University 

AND 

HALLOWELL DAVIS. M D 

Director o/ Besearch 
Central Institute for the Deed 
St Louis Mo 


JOHN WILEY & SONS, INC. 
New York London 



PERSPECTIVE 


In 1863 Helmholtz wrote his hehrc von den Toncmpfindungen, 
bringing together the scattered facts about hearing and defining 
a field of scientific interest There were then hardly more 
than a dozen important facts in this field and some were not 
\ery old It is true that the mathematics of music, the facts 
of inter, als and their ratios, had concerned mathematicians 
ever since Pythagoras, but the velocity of sound had not been 
measured until 1636, and it was not until 1660 that Boyle proved 
conclusively that a bell ringing in vacuo is inaudible Galileo 
in 1638 worked out the chief laws of physical resonance and 
established firmly the principle that pitch is a function of fre 
quency— a principle that stood until 1930 The tuning fork 
dates from 1714, and that year is also the date of Tartmi’s first 
mention of the difference tones, which were for a century called 
by his name In 1830 Savart set the upper and low er thresholds 
of hearing at 24,000 and 8 \ ibrations per second, holding a card 
against the teeth of a rotating wheel in order to determine 
these limits In 1843 Ohm formulated his acoustic law that the 
ear hears the harmonic components of a complex wave form, 
an analytic principle that gained universal meaning m the light 
of Fourier s theorem of 1822 for the analysis of any periodic 
wave into sinusoidal components That, however, was alto- 
gether very little There was not much known about hearing 
when Helmholtz wrote the Tonempfindungen 

Helmholtz himself, besides bringing the physical and physi 
ological facts together, demonstrated the variety of combma 
tion tones and determined the perceptual character of beats 
He established the primary nature of the vowels, and he pro- 
posed a theory of them with an experimental demonstration for 
the theory His keen obsenation and his fertile imagination 
give him a position on many topics, and again and agam he 
has turned out to be right It is the resonance theory of hearing, 



PERSPECTIVE 


how ever, that is his classic contribution in the field of auditory 
sensation 

In spite of all the discussion that went on, really not so very 
much happened in the sixty years after Helmholtz The prob- 
lem of the nature of towels has recurred constantly from 
Donders’ experiments in 1870 and Hermann’s theory of form 
ants in 1895 until the present day The auditory thresholds 
were determined by Prejer in 1876 and then by Luft in 1888 
Stumpf, on the basis of very slight evidence, published his laws 
of tonal fusion in 1890, and they persisted in uncritical ac- 
ceptance for at least thirty years The chief facts of auditory 
localization were established in 1901 (intensity), 1908 (phase), 
and 1920 (time) There was a flurry in tonal attributes about 
1913 1919 And all this while scientists made up theories of 
hearing The frequency theory or telephone theory was first 
posed against Helmholtz’s resonance theory Helmholtzs 
theory was a place theory, in that it derived its cogency from 
Johannes Muller’s doctrine of the specific energies of nerves 
for a different pitch there must be always a different nerve fiber 
Presently there came to be resonance theories that correlated 
pitch with the fiber excited and others that correlated pitch with 
the frequency of excitation, and there were also both place 
theories and frequency theories that rejected the notion of 
analysis by resonance The establishment in 1912 of the all-or 
none law of nervous conduction should have made trouble for 
all the theorists who naively assumed that loudness depends 
upon the intensity of the neural impulse, but it was really a 
dozen years before this difficulty was generally appreciated In 
other words, there was in this field more effort than success 
What was needed was the discovery of a new approach to the 
old problem 

That approach was made possible by the development of 
the electronic vacuum tube for the amplification of small po- 
tentials Although the history of electrical amplification goes 
back to dc Forest in 1907, it was the later commercialization of 
the radio that placed these techniques in the hands of physiol 



PERSPECTIVE vii 

ogists and psychologists In 1929 Wever and Bray applied the 
new technique to the amplification of physiological potentials 
of the auditory mechanism and found that these potentials 
resemble the acoustic stimulus Later research showed that it 
is only the ‘cochlear response that nearly duplicates the stimulus, 
and that the action potentials of the nerve are limited, as might 
have been expected, by the all-or none law and the normal re 
fractory periods It was, nevertheless, the discovery of this 
Wever Bray effect’ that began the wave of investigation into the 
physiology of hearing that has been spreading since 1930 
Aided by the physicists, the psychophysics of threshold values 
and of sensory equivalents has kept pace with this new experi 
mental physiology Thus the reader of this book can see how 
much of the psychology and physiology of hearing has been 
learned within the last decade and also how little of it was the 
product of the preceding half century In this way he will 
be able to gauge for himself the potency of a discovery of a new 
technique Certainly we are ready now for a new Lehre von 
den Tonempfindungcn to orient us among the complexities of 
the new physiological acoustics which is now so successfully 
answering questions which Helmholtz posed 


Edwin G Boring 



P R I^F ACE 

Here is our book It was originally conceived in the high 
hope of doing justice to a subject of inquiry which, within a 
decade, has undergone impressive transformation Now, with 
the finished manuscript before us on the table, we are aware 
that the field of audition is already on the point of expanding 
beyond the confines of a single volume And we find that our 
original purpose has had to accede to the practical demands of 
space The relations of the science of audition to architecture 
and applied acoustics, to speech and phonetics, to the problem 
of noise, and to music are some of our deliberate omissions 
We undertook the preparation of this volume for two rea 
sons We wanted to provide the students of psychology, physi 
ology, acoustics, and otology with an inventory of the recent 
discoveries in the psychophysiology of hearing— discoveries 
which up to now have enjojed relative seclusion in scientific 
periodicals, and we wanted to test the progress of the study of 
audition by casting up the balance in systematic form, taking 
stock of the gaps and deficiencies, and finding to what extent 
auditory research is able to yield a consistent point of view 
The value of this book as an aid to the student of hearing 
might have been improved by a different order of the chapters, 
although the present order was dictated by a desire to achieve 
a logical development of the subject matter, and not merely by 
the fact that a psychologist was responsible for most of the first 
half and a physiologist for most of the second half of the work 
(Incidentally, both psychologist and physiologist did much re 
writing of both halves ) The logic of the presentation is first 
to provide the student with the fundamentals of the science of 
sound — with a minimum of mathematics — and then to tell 
him what he hears when a sound reaches his ears, and what are 
the systematic relations between stimulus and sensation Then, 
knowing what he hears, he is in a position to be told, beginning 

IX 



with Chapter 10, how he hears it The functional anatomy anti 
physiology of the ear, therefore, is the subject matter of the later 
chapters The reader who favors a different order can just as 
well read the last nine chapters immediately after Chapter 1 
Numerous cross-references have been included to aid him 
A glossary of essential terms has been developed in the hope 
that it will prove useful as a convenient source of precise defim 
tions, and two appendices have been added to provide con 
vement reference to some mathematical developments which 
may be beyond the interests of the general reader A third 
appendix contains what we have found to be a very useful table 
for cons erting ratios of sound pressure or voltage into decibels 
We have tried throughout to present a systematic and con 
sistent picture of the auditory process, but we find we have had 
surprisingly little to say about theoncs of hearing This omis 
sion is probably not so much symptomatic of a personal lack of 
interest in ‘theories’ as it is indicative of a state of development 
m the science Theoncs flounsh on a certain sparseness of facts 
and wither m the face of abundance When all the relations 
are known, alternative theories are no longer possible, and, if 
a present inventory of the facts of audition leaves little room 
for theoncs of heanng — in the nineteenth-century meaning of 
the phrase — that situation must be accounted a sign of progress 
Nevertheless, plenty of opportunity remains for the theonst in 
the possible interpretation of many individual items, and wc 
have indulged m our share of speculation Our interpretations 
now appear to us to be consistent with a place theory that docs 
not employ the principle of simple resonance We did not 
begin with this t>pe of theory in mind The fact that a sys- 
tematic survey of the field has altered our point of view seems 
to us to indicate that we were justified in our second reason f or 
undertaking this task the discovery of the extent to which the 
field of audition is able to yield a consistent point of \ iew 
One more confession must be made, lest the reader look for 
what he will not find Many of the topics of these eighteen 
chapters ha\e histones which go back far bejond the last dec 
ade To trace these histories adequately, hoivevcr, would 
demand many more pages of text than good faith would allow 



us to impose upon the reader Furthermore, almost all the 
early psychophysical measurements have recently been repeated 
under the more favorable auspices of modern electrical tech 
niques Consequently, out of our list of references, consisting 
of about 330 titles, more than 280 bear a date more recent than 
the paper by Forbes, Miller, and O Connor, who, just ten years 
ago, first described synchronized nerve impulses in the auditory 
pathways of the brain Although we have made no attempt to 
assemble a complete bibliography, even of recent papers, the 
appended list of references should provide adequate leads for 
the student who wishes to pursue a topic to its roots 

Finally, in assembling this review, we have been impressed 
by the variety of sources from which the facts of hearing are 
derived It is characteristic of the science of audition that it 
ignores the traditional boundaries between the sciences None 
of the traditional disciplines nor any of the academic depart 
ments of the modern university can claim audition exclusively 
as its own The mystery of the ear inspires the psychologist, 
the physiologist, the otologist, and the physicist alike Hence, 
although much of the work recorded in these chapters has been 
earned out at Harvard University, it is significant that no single 
laboratory is exclusively responsible for it There has been ac 
tive collaboration between the authors, representing psychology 
and physiology, and members of the departments of otology 
and physics Elsewhere the situation is similar From the 
Laboratory of Psychology at Princeton University, from the 
Bell Telephone Laboratories, from the Animal Hearing Labora 
tory at the University of Illinois, from the Department of Physics 
m the University of Michigan, from the Department of Psychol 
og’j at the State o£ Iowa, fieto. the Qtolagical Rfi 

search Laboratory at Johns Hopkins University, from the 
Government Laboratories of Hungary, from the Telefunken 
Laboratories of Berlin, from the Laboratories of Physiology and 
of Psychology in the University of Cambridge, and from many 
other active laboratories, comes an impressive stream of new 
discoveries — a fact which means that this book is a current in 
ventory and not a final summary 



XII PREFACE 

The preparation of any manuscript is an arduous task, and 
authors universally feel indebted beyond expression to those 
whose helpfulness makes the task bearable Dr M H Luncof 
the Department of Otology has not only collaborated in many 
of the experiments here recorded, but has kindly provided sev 
eral photomicrographs of the inner ear Professor E G Boring 
read a large part of the manuscript and saved it from many 
faults Professor F V Hunt contributed \aluable suggestions 
for the improvement of Chapter 1 Mr A H Bernstonc de 
voted much time and talent to the drawing of many of the 
illustrations He also read the entire manuscript and con 
tnbuted to its improvement Dr A F Rawdon Smith read 
the proof and suggested valuable modifications Mr Frank 
O Neill s skill as a photographer has aided in the reproduction 
of many of the figures Mrs f C Leighton turned our battered 
first drafts into good copy for the printer, and her skill has been 
an invaluable asset To all these friends we arc grateful 

S Smith Stevens 
Hallowell Dams 

Harvard University 
December 28 1937 



ACKNOWLEDGMENTS 


It is a pleasure to acknowledge the helpful and generous cooperation of 
authors, editors, and publishers in allowing us to use, either in their original 
form or with modifications, many of the figures which illustrate this book. 
We extend our thanks to the authors and publishers cited in the legends 
The original sources of publication will be found tn the bibliography under 
the authors names We are also specifically indebted to editors and publishers 
as follows 

Akademische Verlagsgesellschaft M B H Figs 64 66 67 
American Academy of Ophthalmology and Otolaryngology Fig 1 13 
American Institute of Electrical Engineers Figs 70, 71, and 75 
American Journal of Physiology Figs 110, 128, 130, 131, 147, 143, 149, 150, 
15 3, 156, 157, 158, 163, 164, 165 
American Journal of Psychology Figs 68, 74 

American Laryngological, Rhinological and Otoldgical Society Fig 1 15 

American Medical Association Press Fig 22 

Annals of Otology, Rhinology and Laryngology Fig 141 

Johann Ambrosius Barth Figs 15, 39, 58, 59, 90, 94, 100, 101 

General Radio Co Table of decibels. Appendix III 

Clark University Press Fig 160 

Electronics Fig 7 

Harvard University Press Quotation on p 17 
S Hirzel Figs 38, 42, 91, 104, 10S, 116 
The Johns Hopkins Press Figs 136, 137 

Journal of the Acoustical Society of America Figs 14 16, 17, 21, 23 24, 25, 
26, 27, 29, 30, 31, 35, 44, 46, 47, 48, 49, 50, 52, 57, 60 73, 76 81, 83 84 85, 
86, 90, 93, 107, 118, 129, 152, and Table 1 
The Journal Press Figs 123, 134, 135, 138, 139, 140 
Laryngoscope Fig 146 

National Academy of Sciences Figs 72, 78, 79, 80, 82, 87 
Physical Review Figs 53, 55, 88, 89, 92 
Psychological Review Company Figs 61, 62, 69 
Julius Springer Figs 37, 40, 65, 102, 103, 106, 109, 1 19 
University of California Press Fig 161 
University of Chicago Press Fig 162 



CONTENTS 

PACE 

Perspective v 

Preface ix 

CHAPTER 

1 The Nature of the Auditory Stimulus 1 

2 The Sensitivity of the Ear 42 

3 Pitch 69 

4 Loudness 110 

5 The Other Attributes of Tones 160 

6 Auditory Localization 167 

7 Aural Harmonics and Combination Tones 184 

8 Auditory Masking, Fatigue and Persistence 208 

9 Modulation Vibrato and Beats 225 

\_-10 The Mechanics of the Ear 248 

11 Deafness and Bone-Conduction 288 

Principles of Neurophysiology 296 

13 The Microphonic Action of the Cochlea 310 

14 Considerations as to the Nature and Origin of Aural Micro* 

phomes 333 

15 The Localization of Frequency Reception on the Basilar 

Membrane 356 

^/l6 Auditory Nerve Impulses 376 

17 Nerve Impulses in Response to Tonal Stimulation 393 

18 Nerve Impulses m the Higher Auditory Pathways 414 

Appendixes 439 

Glossary 449 

References 457 

Indexes 473 


xv 



CHAPTER 


THE NATURE OF THE AUDITORY STIMULUS 

The development of the thermionic vacuum tube has revital 
lzed the science of acoustics Whereas it was formerly necessary 
to discuss the production, transmission, and recording of sound 
energy in terms of purely mechanical devices such as tuning 
forks, organ pipes, strings, sirens, tubes, and reeds, we now 
treat these problems m terms of electrical and electromechanical 
systems such as microphones, amplifiers, and loud speakers 
Before the advent of electronic instruments, the science of sound 
had already reached a fullness (Miller, 2)* at the hands of 
Helmholtz, Lord Rayleigh, and others which made the nature 
of soundwaves well understood Rudolph Koenig had pro- 
duced his mechanical masterpieces by which he generated and 
measured sound waves of various frequencies and forms Mu 
sical instruments had for the most part reached their present 
stage of evolution, and through the efforts of Sabine a basis for 
the exact science of architectural acoustics had been discovered 
Nevertheless, the prmcipal impediment to further progress in 
acoustics was the lack of an efficient method for producing and 
measuring sounds of any desired frequency, intensity, or com 
plexity — a deficiency which has now been removed by the 
development of electromechanical instruments 

This first chapter will consider the principles of sound, as 
they bear upon the problem of the understanding and control 
of the auditory stimulus, and the nature of electromechanical 
systems, in so far as they are utilized in psychological and 
physiological investigations 

THE STIMULUS TO HEARING 

Any vibratory motion which can be communicated to the 
auditory mechanism is capable of arousing auditory sensations 
# For explanation of the system of references see p 457 
I 



2 


THE NATURE OF THE AUDITORY STIMULUS 


Stimulation occurs most commonly by sound waves m air, but 
occasionally by sound waves in water or directly m the bones 
of the head Sound waves in any medium consist of rapid 
vibratory motions on the part of the ‘particles’ making up the 
medium They are called waves because the motion of one 
particle tends to disturb the adjacent particle, which m turn 
disturbs the next one, so that a ‘wave’ of disturbance passes 
throughout the medium Thus in the case of air the rapid 
forward movement of the prong of a tuning fork compresses 
the air adjacent to the prong, but the elasticity of the air prevents 
this localized region of compression from being maintained and 
expansion takes place at the expense of compression of the ad 
joining region, so that a wave of excess pressure emanates from 
the prong In a similar manner, the backward movement of 
the prong sets up a wave of exp’ansion or diminished pressure 
Now, when the prong executes the simplest sort of to-and fro 
motion, so as to generate a pure tone, its behavior may be 
described quantitatively m terras of two dimensions the fre 
quency of its vibrations and the amplitude of its excursions 
Likewise we can attribute to the resulting tonal stimulus two 
physical dimensions, frequency and intensity Then all the 
psychological and physiological phenomena which result from 
stimulation by continuous pure tones can be expressed as func 
tions of these two variables If a tone is not pure, w e must add 
another variable to account for the combinations of frequency 
and mtensity which go to make up the complex tone 

To define the stimulus to hearing as sound is, of course, a 
conventional and convenient shorthand Sound in acoustics 
has come to mean the vibrations of bodies or the transmitted 
vibrations in media The rigorous definition of the stimulus 
to hearing, however, is much more difficult (Boring, 4) and 
sMsft vi wvtt&K zwdwwj Whew 

we analyze hearing into its various aspects, or attributes, such 
as pitch and loudness, and ask what is the stimulus to each of 
them, the answer is not simple Each turns out to be a com 
plicated function of the dimensions of the vibratory disturbance 
Therefore the exact specification of the stimulus to any aspect 



THE DIMENSION'S OF SOUND 


3 


of auditory sensation must be expressed in terms of a function 
of several variables The stimulus to loudness, for example, 
can be represented in the form 

L — f(F, I, C, S) 

where F is frequency, I intensity, C complexity, and S is a term 
for the stage along the contmuum from the vibrating body 
through die medium, eardrum, ossicles, etc , at which F, 1, and 
C are measured This function must satisfy the criteria (1) 
that equal values of the function produce equal magnitudes of 
the attribute (pitch, loudness, etc ) and (2) that, where it is 
possible to construct quantitative scales for the measurement 
of the attribute, the value of the function is proportional to the 
magnitude of the attribute as measured on the quantitative 
scale 

Needless to say, the complete functions cannot as yet be 
written for the stimulus to every aspect or attribute of hearing 
In the chapters which follow we shall, for the most part, hold 
C and S constant and express graphically the functions between 
the attributes and their stimuli on plots whose coordinates are 
frequency and intensity Hence we shall continue to speak of 
sound waves as the stimulus to hearing, but we shall understand 
that the precise definition of the stimulus can be given only 
after we shall have determined the complete functions relating 
each attribute to every dimension of the sound waves 

THE DIMENSIONS OF SOUND 

The word dimension is used here to mean any of what arc 
commonly called the ‘physical’ aspects of sound, such as fre 
quency, energy, velocity, and phase These are ways in which 
sounds may var> , or they are scales in terms of which sounds 
may be measured The ‘physical’ aspects are commonly dis- 
tinguished from the ‘subjective’ or ‘psychological’ aspects of 
sound, and it is well for our purpose to ascertain upon what 
operations or concrete procedures such a distinction rests 

The operations involved m the measurement, and hence in 



4 


THE NATURE Of THE AUDITORY STIMULUS 


the definition (Stevens, 6), of the energy of sound consist of 
noting the effect of the sound wave on some other physical 
system such as a microphone with its associated amplifier and 
output meter On the other hand, the operations involved in 
the determination of the loudness of a sound consist of the direct 
procedure of noting the effect of the sound on the living organ 
ism The difference between the two procedures lies in the 
fact that in measuring the ‘physical' aspect an observer makes 
a judgment about a scale reading, whereas in determining the 
‘subjective’ aspect an observer makes a judgment directly about 
the sound wave itself as it affects Ins sense-organ There is 
perhaps no reason for considering the one type of observational 
judgment as any more basic or ‘physical’ than the other, except 
for the fact that the observations of physics— those which we 
call pointer readings— constitute the class of human reactions 
which show the greatest uniformity among individuals and 
which have therefore been made basic in the exact sciences 
The direct observation of the aspects of a stimulus, without the 
aid of instruments, is made with much less precision Con 
sequently we can say that the general problem of the psychology 
of hearing is that of observing the aspects of sound, as it affects 
the organism directly, and comparing die results with observa 
tions of the aspects of sound made with the help of instruments 
This chapter deals with the ‘physical’ aspects of sound and the 
instruments with which we measure them 

THE FUNDAMENTALS OF SOUND 
A convenient aid to the understanding of the physical 
dimensions of sound is to relate the motion of the vibrating body 
to what is known as the projection of uniform circular motion 
The simplest form of vibratory motion, that executed by the 
pacing t£ u ViYiYnig foils. Andes pfiopts cstnd'AvcffA os ky a p«v 
dulum swinging through a small amplitude, is called simple 
harmonic motion Simple harmonic describes the motion of 
any body which is displaced from its normal position and then 
set free, provided the force needed to displace the body is pro- 
portional to the amount of thrdisplaccment When the force 



THE FUNDAMENTALS OF SOUND 


5 


is proportional to the displacement, the body is said to obey 
Hooke’s law, and the simple harmonic motion which it ex 
ecutes is equivalent to the projection of the motion of a point 
moving around a circle at a constant rate Thus in Fig 1 if 



O TT 2TT 


Fig 1 Show ng how a smuso da! wa\ e is generated 1 y tl e projection of 
circular motion and how two sinusoidal wates may \ary in frequency amph 
tude and phase 

the point P moves around the circle in a counterclockwise 
direction, its projection on the vertical axis is represented by 
the pomt R As P rotates at a constant rate, R moves up and 
down the axis of the circle with precisely the same form of 
motion as that of the prong of the tuning fork In order to 
represent this motion graphically we have to spread it out, as 
shown in the solid curve at the right of Fig 1 , the crests repre 
sent the regions of compression and the troughs the regions of 
expansion of the longitudinal sound waves This curve is 
obtained by plotting the distance OR against the angle swept by 
the line OP If we let o> stand for the angular velocity of P 
(in radians per second), the angle becomes u>t, where t is the 
time elapsed since P was at the point M Ndw, the distance OR 
is proportional to the sine of the angle rot so we speak of simple 
harmonic motion as sinusoidal motion and represent it by the 
equation 

y — A sm tat 

where y is the displacement OR and A is the length of the 
vector OP , the distance which is called the amplitude of the 
wave A is the maximum value y can have at any part of 
the cycle The number of times P goes around the circle in one 



6 


THE NATURE OF THE AUDITORY STIMULUS 


second is the frequency, and since there are 2r. radians in a 
complete circle the relation between frequency F and angular 
velocity to is 


w = 2 izF 

The total time required for the point P to revolve once around 
the circle is known as the period T, it is the reciprocal of the 
frequency F. 

By phase is meant the angle between OP at a particular 
instant and the same OP at some other instant which is taken 
as the reference or starting point If two sinusoidal motions 
are being considered, we can draw two rotating vectors and 
speak of their relative phase as the angle between them at some 
selected instant Thus in Fig 1 the point P', rotating around 
the dotted circle, represents by its projection on the vertical axis 
a second sinusoidal wave which leads the first by the phase 
angle 9 , at the particular instant in question 

If P ' rotates twice as fast as P and if it is ahead of P by the 
angle 9 , at the instant from which we begin to measure time, 
the projection of its motion can be represented by the dotted 
curve to the right in Fig 1 Then, if the vibrating body is 
executing both motions simultaneously, we can add the dotted 
curve to the solid curve and obtain a curve which represents 
the resultant motion of the body Any number of rotating 
vectors could be added, with any amplitude and phase, and the 
resultant wave would become more and more complex In 
fact, by adding the waves represented by the proper rotating 
vectors, any predetermined waveform can be achieved The 
word complexity is used loosely to designate the number of 
simple harmonic waves which go to make up a given wave 

Unfortunately, there is no simple scale on which we can 
order sounds in terms of their dimension of complexity, because 
two complex sounds having the same component frequencies 
can differ in terms of the phases and amplitudes of the com 
ponents A common procedure is to classify tones m terms of 
the ratio of the energy in all the frequencies except the lowest 


THE FUNDAMENTALS OF SOUND 


7 


or fundamental frequency to the energy in all the frequencies 
including the fundamental Thus a tone with 5 per cent 
distortion would have 95 per cent of its energy carried by its 
fundamental frequency, or would be 95 per cent pure More 
usual, perhaps, is the procedure of defining the percentage of 
distortion as the ratio of the sound pressures of the harmonics 
to the sound pressure of the fundamental frequency This 
designation of distortion should be used, however, only when 
the energies of the harmonics are proportional to the squares 
of their pressures 

For complex tones produced by such generators as a ubrat 
ing piano string the component frequencies are in the ratio 
1 2 3 4, etc, to each other and are called harmonic jrequen 
ctes, or partials The first harmonic, or the first partial, is the 
fundamental tone, the others are sometimes called overtones 

In our efforts to determine the basic capacities of the ear to 
discriminate the aspects of sounds, we try to employ pure tones 
It is doubtful, however, if we ever succeed in generating tones 
which have no overtones at all, for even though the vibrating 
body which produces the sound waves executes simple har 
mornc motion, the air through which the waves must be trans- 
mitted is not perfectly elastic (does not rigidly obey Hooke’s 
law) and some distortion is produced (Fay) This distortion 
is so small, however, compared to the distortion introduced by 
the ear itself (see Chapter 7), as to be entirely negligible 
Hence, for all practical purposes, we are able to stimulate the 
ear by what may reasonably be called pure tones 

The word velocity as applied to sound usually means one 
of two things It ordinarily refers to the velocity of propaga 
tion of the sound wave, which determines the time required for 
a sound produced at one place to be heard at a place some 
distance away, and which in air is about 331 meters per second 
We can also speak of the velocity of the individual particle of 
air whose motion makes possible the soundwave On the 
circle of reference in Fig 1 the point R represents this particle, 
and the velocity of R at any instant represents what is called the 
particle velocity at that instant The velocity of the particle is 



THE NATURE OF THE AUDITORY STIMULUS 


much smaller than the \eIocity of the wave, for, even though 
the particle is in rapid vibration, it mo\es through such a small 
amplitude that its maximum velocity is usually of the order of 
a fraction of a millimeter per second This maximum velocity 
occurs at the moment the particle ( R in Fjg 1) passes its normal 
position of rest 

The length of a sound wave is the distance traversed by the 
sound in the time required to complete one cycle The u ave 
length is equal to the velocity of the wave divided by its fre 
quency Thus in air the ware length of a 1000-cycle tone is 
approximately 1 ft, but, since in water the velocity of sound 
is about four tunes as great as in air, the wave length of a 1000- 
cyclc tone in water is about 4 ft A tone of 1000 cycles trans- 
mitted to the ear through water would have the same pitch as 
a tone of the same frequency transmitted through air, provided 
the two were at the proper rclatn e intensities This means that 
the dimension of wave length, which differs so greatly m dif 
ferent media, is of no immediate significance in hearing It 
does, nevertheless, become important when one attempts to 
localize the source of a sound (see Chapter 6) 

FORCED VIBRATIONS AND RESONANCE 

Whenever a vibrating body is coupled to another body, that 
is to say, connected to it either directly or by way of some inter 
mediate medium such as air, the motion of the first body is 
communicated to the second A loud noise shakes the window 
panes A vibrating tuning fork, whose base is pressed against 
the top of a tabic, sets the whole table vibrating and thereby 
intensifies the sound of the fork These are examples of forced 
vibrations 

If it so happens that the natural period of the forced body is 
the same as the period of the first body, the transmitted effect 
is greatly intensified This phenomenon is known as resonance 
Resonance occurs whenever there is impressed upon a body the 
frequency at which it would vibrate if set in motion and then 
left to itself Thus when a tuning fork, in the neighborhood 
of another fork of identical natural frequency, is struck, vibra 



FORCED VIBRATIONS AND RESONANCE 


9 


tions are set up in the other fork A more striking experiment 
is to hold down the loud pedal of a piano, which frees the 
strings so that they can vibrate freely, and sing a brief note 
This excites those strings whose natural frequencies correspond 
to the frequencies present in the voice, and a faint replica of the 
voice is heard after the singing has ceased The strings which 
vibrate are said to be tuned to the frequencies of the voice 

All methods of recording and reproducing sound depend 
upon forced vibrations The ear itself functions only when 
forced into vibration, and, according to Helmholtz, the ear’s 
power to discriminate one frequency from another depends 
upon what may be regarded as a phenomenon of resonance 
The importance of these principles requires that we examine 
them more closely 

In a vibrating system like a tuning fork, three factors de 
terraine what the form of the motion will be They are the 
mass m, the stiffness /, and the resistance r If the mass is 
increased, the vibrations become slower If the stiffness is 
increased, they become faster And if the resistance to the 
motion is increased, the vibrations die out more rapidly and the 
system comes to rest sooner On the assumption of Hooke’s 
law, that the restoring force is proportional to the displacement, 
it is possible to set up and solve the differential equation which 
expresses these facts The solution shows that the frequency 
of a system for which the resistance is negligible is 



If the resistance is appreciable, the frequency is decreased 
slightly and the amplitude of the vibrations falls off according 
to an exponential curve as shown in Fig 2 

It is a characteristic of such cases of damped vibration that 
the ratio of the height of the crest A to the height of crest B is 
the same as the ratio of B to C, of C to D, etc The constancy 
of this ratio makes possible a convenient measure of the rate 
of decay of damped vibrations This measure is derived from 



10 


THE NATURE OF THE AUDITORY STIMULUS 


the fact that the amplitude y of the envelope enclosing the 
waves in Fig 2 is given as a function of the time /, by the 
equation 

rt_ 

y - Ae ** 



Fic 2 Showing how a damped vibration declines exponentially The 
heights of the successive crests bear a constant ratio to one another 


of the height of crest A to that of crest B is known as the 
logarithmic decrement per cycle The equation for the log 
anthmic decrement is 



so that, if the resistance r, the mass m, and the period T of the 
system are known, the logarithmic decrement may be found 
directly 

The ratio r/2m is known as the damping factor of the sys- 
tem The reciprocal of the damping factor is sometimes called 
fee modulus ol decay or fee time constant ot fee system Tms 
constant represents the time taken for the amplitude to fall to 
a proportion of its initial value equal to the ratio 1/e or 1/2 718 

It is important to note that critical damping occurs when 



FORCED VIBRATION'S AND RESONANCE 


11 


Under this degree of damping, the system comes back from 
its displaced position to its position of rest without once passing 
beyond this position In other words, no free oscillations occur 
whenever damping is critical or greater than critical 

Now, in the case of forced vibrations, the system is not 
allowed to come to rest in the manner shown in Fig 2, but is 
maintained in oscillation by a periodic force When, however, 
such a periodic force is suddenly applied to the system, it gives 
rise to two effects First of all, on the application of the 
external force, a type of oscillation is set up which proceeds 
forthwith to die away in the manner shown in Fig 2 These 
oscillations are called transients After they have died away 
there is left a steady state response to the external periodic 
force 

These two effects, a transient and a steady state, are present 
whenever a system, initially at rest, is set into motion by a con 
tinuous periodic force The time taken for the system to reach 
the steady state depends, of course, upon the speed with which 
the transient oscillations are damped out If the damping is 
large, the steady state is reached almost at once Transient 
oscillations occur also when the external driving force is re 
moved, for the motion of the system must then go through the 
sort of exponential decay which characterizes all vibrating sys 
terns in which resistance is present The frequency of the 
transient vibrations is the natural period of the system, the 
frequency of the steady state vibrations is the frequency of the 
driving force Hence it is of great importance in all systems 
for recording or producing sound, such as microphones and 
loud speakers and even the ear itself, that the damping: be 
large, for otherwise the transient frequencies would persist 
long enough to interfere with the frequency of the external 
force which is bemg impressed upon the system What con 
fusion would result if the elements of the ear continued to 
vibrate long after a note had ceased sounding 1 

When a periodic force acts upon a system and forces it to 
oscillate, the amplitude of the oscillations, and hence the veloc 
ity of the vibrating particles, depend upon the relation of the 



12 


THE NATURE OF THE AUDITORY STIMULUS 


frequency of the driving force to the natural frequency of 
the system At resonance the tw o frequencies are the same, and 
the velocity of the particles of the forced system is a maximum; 
but, as the dm mg frequency departs more and more from the 
natural frequency of the $)stem, the velocity imparted to the 
system decreases By plottmg the velocity of the system against 
the frequency of the driving force we obtam such resonance 
curves as are show n in Fig 3 Curve A represents a system in 
which the resistance is small, 
and B, a system m which the 
resistance is large In other 
words, when a system is 
highly damped, as repre- 
sented by curve B, the maxi 
mal velocity reached at res- 
onance is less and the peak of 
the resonance curve is much 
less sharp Such a system is 
termed dull Similar curves 
could be drawn to represent 
the amplitude of a forced sys- 
tem as a function of the driv- 
ing frequency, except that the resonance curves for amplitude 
arc not quite symmetrical and do not have their maximum at 
exactly the natural frequency of the forced system unless the 
damping is negligible 

According to one view, the analyzing mechanism of the 
inner ear can be treated as if it were equivalent to a row of 
resonators, each tuned to a particular frequency Consequently 
it is of interest to note that the curves of Fig 3 apply equally 
well if we consider the abscissa as representing the natural fre- 
quencies of a row of ’little resonators which arc 'being forced 
into vibration by a single alternating force at the frequency of 
the middle resonator The ordmate would then represent the 
maximum velocity which each resonator would attain, and the 
duller curve would, as before, represent the case for which 
the damping is great (Certain objections which make this 



Fie 3 Showing how a resonant sys 
tem responds to different frequencies 
For an impressed frequency F„ which 
is the natural frequency of the system 
the velocity attained is maximal When 
damping is large the resonance of the 
system is dull, as shown by curve B 



DISTORTION' UNDER FORCED VIBRATION 


13 


type of theory of questionable value m the treatment of the ear 
will be considered later Cf Appendix II ) 

From a consideration of all the factors which determine the 
behavior of a system like that hypothesized for the ear, it is 
clear that nature must make compromises The idea! of high 
selectivity would demand that each resonator respond vigor 
ously to its own frequency, but only slightly to other frequencies 
This requirement would mean sharp tuning and small damp 
ing With small damping, however, it takes a longer time for 
the system to come to a steady state in response to an applied 
force, for the initial and final transients tend to persist and 
interfere Thus, if the elements in the inner ear were res- 
onators, and were no more highly damped than the strings of 
a piano when the loud pedal is pressed, we should find our 
selves able, perhaps, to distinguish smaller differences in fre 
quency than we can at present, but we should pay for it with 
extreme annoyance due to sensations which refuse to die out 
The obvious failure of auditory sensations to persist is proof of 
the great damping in the inner ear (see also Chapter 10) 
Clearly, nature has, for the ear, effected an excellent com 
promise 

DISTORTION UNDER FORCED VIBRATION 

When a simple harmonic force is applied to a system, simple 
harmonic oscillations are set up, provided the system is linear, 
or, m other words, provided the displacement of the system is 
proportional to the applied force For linearity to obtain, it 
is also necessary that the system be symmetrical in that it moves 
from its position of rest as easily in one direction as in the oppo- 
site Thus a responding system — an ear, a microphone, a loud 
speaker, a phonodeik, or a tuning fork — introduces amplitude 
distortion if it is unsymmetrical or does not obey Hooke’s law 
in both directions 

The reason for this distortion can be illustrated by an ex 
ample If, instead of the displacement being proportional to 
the force, we assume it to be proportional to the square of the 



14 


THE NATURE OF THE AUDITORY STIMULUS 


force, we can represent the relationship between force and dis- 
placement by the ‘characteristic’ curve in Fig. 4. (In a linear 
system this would have been a straight line.) Then if we apply 
a sinusoidal force, as shown at the bottom of the diagram, the 
resulting displacement will be represented by the curve at the 



Frc 4 Showing how a nonlinear relation between force and displacement 
generates a wave containing harmonic components A sinusoidal applied 
force produces a resulung displacement which can be analyzed into the waves 
shown by the dotted curves 

right of the diagram This curve is clearly not sinusoidal. In 
fact, it can be shown mathematically to be equivalent to the sum 
of the two dotted curves plus a constant factor (which need 
not concern us). In other words, both the first and the second 
harmonics are present in the response of a system in which a 
square-law relation obtains between force and displacement. 

In the esent that the characteristic curve for the system is 
not limited to the square law, but requires higher terms for its 
representation, the equation for the displacement becomes 

y = a + bj + cf + dj'+... 



DISTORTION UNDER FORCED VIBRATION 


15 


where y is displacement, f is force, and a, b, c, etc , are constants 
In this case the higher harmonics are introduced, as can be 
shown if we substitute for / a sinusoidal force (/ = fa sin tor) 
and carry out the trigonometric reduction 

The really interesting effect to be obtamed from a nonlinear 
system occurs when, two sinusoidal frequencies are applied 
simultaneously Here again it can be demonstrated mathemat 
ically that the result is a conglomeration of frequencies set up 
in the system Thus, if the frequencies m and n are applied 
together ( m^>n), the resulting motion of the system will be 
compounded of the frequencies m , n (m — «), (m 4- «)> 2m, 
2 n, (2m— n), (2n — m), etc These component frequencies are 
not mere mathematical fictions, their physical existence can be 
shown by means of a system of resonators tuned to each one 
If the system is the car, their presence is confirmed by the 
existence of the so-called subjective overtones, difference tones, 
summation tones, etc (see Chapter 7) 

In addition to amplitude-distortion wc may be concerned 
in electrical and mechanical systems with frequency-distortion 
and phase-distortion Frequency-distortion occurs whenever a 
system responds unequally to different frequencies Thus the 
curves of Fig 3 represent a system in which there is frequency 
distortion, because the response of the system to the resonant 
frequency is clearly different from that to any other frequency 
In the design of an instrument, such as a high quality micro- 
phone, m v. hich frequency-distortion is to be minimized, it is 
necessary, therefore, to avoid having the instrument tuned to 
any of the frequencies to be recorded A practical rule (A H 
Davis) is that, to obtain a true record of the relative amplitudes, 
the natural frequency of the instrument must he 5 to 10 times 
the highest frequency to be recorded This insures that the part 
of the resonance-curve which will be utilized is the relatively 
flat part far removed from the resonance peak 

Phase-distortion occurs when the response of the system does 
not preserve the phase relations between the components of the 
applied force This type of distortion is not very serious in 
acoustical systems, because, with slight exceptions (see Chapter 



16 


THE NATURE OF THE AUDITORY STIMULUS 


7), the ear is not sensitive to differences of phase between the 
components of a complex tone 

INTERFERENCE 

A very simple principle underlies the phenomenon of inter 
ference If two equal forces arc applied to a particle from 
opposite directions they cancel each other and the net effect is 
ml Thus, if two tuning forks of the same frequency arc 
sounded in such a way that a wave of compression from each of 
them reaches a pomt midway between them at precisely the 
same instant, their effects on a particle at that point are canceled 
But, if a wave of compression from one fork coincides with a 
wave of expansion from the other, the effect on the particle is 
doubled Now, if the frequencies of the two forks differ by a 
slight amount— 2 cycles, let us say— their wa\es will cancel 
each other the part of the time that they are out of phase, and 
reinforce each other the rest of the time when they arc in phase 
Consequently, a person listening to the forks experiences a 
periodic waxing and waning of the sound at a frequency of 2 
per second Such periodic changes in intensity are called beats 
(see Chapter 9) 

It is not necessary, however, to have two sources of sound in 
order to demonstrate interference Indeed, whenever a tone is 
produced inside a room, the waves which are reflected by the 
walls return to interfere, and diminish or reinforce the sound, 
depending upon the phase relations between the direct and the 
reflected waves This phenomenon gives rise to many serious 
problems in acoustics Perhaps the most drastic consequence 
of interference due to reflected wa\cs is that it becomes impos- 
sible to know the intensity of a tone at a given point in a room 
merely by knowing the intensity of the tone at the source In 
free space sound waves behave as light water, the mtensit) de 
creases inversely as the square of the distance from the source, 
but, the moment it becomes possible for a wave reflected by 
some surface or object to reach the pomt at which the intensity 
is being measured, the simple relationship of the square law 
no longer holds Since almost all smooth, rigid wall surfaces 



INTERFERENCE 


17 


are even more efficient reflectors of sound than mirrors are of 
light, the difficulties in the way of predicting the behavior of 
sound in a dosed room are comparable to what they would be 
for light if every surface in the room were a polished mirror. 

A continuous steady tone released in a room quickly sets 
up a pattern of standing waves, that is to say, a pattern of can- 
cellations and reinforcements which results in the intensity of 
the sound being very different at different places in the room 
By moving the head slightly when a contmuous tone is sounding 
in a room, marked changes in loudness are readily experienced 

Among the first to call attention to the problem of inter- 
ference was W C Sabine He wrote the following 

In order to show this [the effect of standing waies] in a definite 
manner, I have measured the intensity in all parts of a certain laboratory 
room It was found, near the source, even at the source itself, the 
intensity was m reality less than at a distance five feet from the source 
And yet the clever experimenter Wien, and no less skilled psychologists 
Wundt and Munsterberg have assumed, under similar conditions, the 
law of variation of intensity with the inverse square of the distance 

Not only do the walls reflect sound in such a way that it becomes 
many times more intense than it otherwise would be but even the 
total quantity of sound emitted by the source itself may be greatly 
affected by its position with regard to the interference system of the room 

It is thus necessary in quantitative research in acoustics to take 
account of three factors the effect of reflection by the walls on the in 
crease in the total intensity of the sound in the room, the effect of inter 
ference m greatly altering the distribution of this intensity , and the effect 
of the reaction of the sound vibrations m a room upon the source 
itself 

In choosing a source of sound, it has usually been assumed that a 
source of fixed amplitude was also a source of fixed intensity, eg, a 
vibrating diaphragm or a tuning fork electrically maintained On the 
contrary, this is just the sort of source whose emitting power varies 
with the position in which it is placed in the room [because, if the source 
is placed at a point of reinforcement in the pattern of standing waves, its 
vibrations are more effective in imparting energy to the surrounding 
medium] 

The remedy to be applied when it is desired to stimulate the 



18 


THE NATURE OF THE AUDITORY STIMULUS 


car in a closed room and at the same time maintain adequate 
control of the intensity of the sound is either to pre\ent, b) the 
use of earphones, the sound from running loose in the room, 
or to treat the walls with an absorbent material which will pre- 
vent reflection This procedure is analogous to painting black 
the walls of a room in w hich light is to be used for experimental 
purposes By the proper selection of materials and the exer- 
cise of care in placing the source and the listener, the amount of 
reflected sound reachmg the listener can be made negligible 
compared to that which reaches him directly Ncv crthcless it is 
impossible completely to duplicate the conditions of 100-per 
cent absorption which characterize free space 

The effect of increasing the absorption of the w alls of a room 
is to decrease the apparent average intensity of the sound from 
a given source, because the intensity which normally results 
from the sound s reflecting back and forth from wall to wall 
for a period of time after it has left the source is absent Like- 
wise, when the source is suddenly extinguished, the sound dies 
out sooner Under these conditions w c say that the reverbera 
uon ttme of the room has been decreased Reverberation is 
probably the most important single concept in the new science 
of architectural acoustics, for the satisfactormess of an audi 
torium as a place for listening to speech and music is greatly 
dependent upon the length of time a sound persists after it has 
left the source This time must be neither too short nor too 
long 

Caution should be voiced concerning the use of tubes for the 
purpose of conducting sound to the ears, for standing waves are 
set up as readily in tubes as in rooms Perhaps the most satis- 
factory method for conducting sound waves bj tubes is to 
employ the equivalent of a tube of infinite length and use a 
short side tube (short compared to wave length of the sound) 
to conduct the sound from the mam tube to the ear A practi 
cal ‘infinite tube’ can be obtained by taking about 20 ft of 
garden hose and lining it throughout its length with some 
absorbent material, such as mohair Strange as it may appear, 
if an unlmcd tube is left open at the end reflection will occur 



TUL ANALYSIS Of SOUND 


19 


THE ANALYSIS OF SOUND 

As wc have already seen, simple harmonic motions can be 
added together to produce a complex motion What is more 
important, however, is the fact that any complex periodic 
motion can be analyzed into a series of simple harmonic 
components The truth of this statement can be demonstrated 
mathematically by Fourier s theorem For our purposes this 



1 



Fig 5 Show ng how a complex wave may be analyzed into Fourier com- 
ponents in harmonic relation (M Her 1) 

theorem may be stated as follows given any periodic motion 
having a fundamental frequency n the same motion can be 
reduced to one particular set of simple harmonic motions of 
suitable amplitudes and phases whose frequencies are « 2 n 3» 
4 n etc Thus the tone emitted from a violin may appear on an 
oscillograph as the upper curve m Fig 5 This curve can then 



20 


THE NATURE OF THE AUDITORY STIMULUS 


be analyzed into the components represented by the curves 
numbered 1 to 3 In genera!, the greater the number of com 
ponents taken, the more faithful is the reproduction of the 
original curve when the components arc put back together 
Several ingenious mechanical devices have been developed both 
for discovering the components of the wave, once its form is 
known, and for synthesizing the components to recover the 
original wave form 

An analysis of a sound wave could be made directly by 
means of a senes of resonators (such as Helmholtz’s resonators) 
each tuned to a different frequency Each component fre 
quency of the complex sound would then activate only that 
resonator to which it is tuned, and the composition of the sound 
could be determined by noting which resonators respond and 
how much The obvious disadvantage of this method is the 
inconvenience of providing a sufficient number of resonators 
adequately to cover the range of frequencies Consequently 
it is little used in practice 

The ear, however, does its analyzing as though it contained 
just such a senes of resonators (see Chapter 10) As a result, 
the car is in general able to detect the presence of component 
frequencies in a sound wave and to identify their pitch provided 
they are not too numerous or too faint This interesting fact 
is known as Ohtns acoustical law The mechanical and physio- 
logical mechanisms underlying this law will be dealt with later 

Since most of the sounds with which vve arc concerned m 
modern acoustics arc either produced electrically or can be 
readily converted into an electric current by means of a micro- 
phone and amplifier, the analysis of sound can be reduced in 
practice to the analysis of electric currents Electrical wave- 
analyzers have the important advantage that continuous tun 
mg can be employed so that every component of the wave can 
be detected and measured Many types have been developed 
(Hall, 2) A typical commercial analyzer consists, function 
ally, of two parts The first is a voltmeter tuned to respond 
to a single frequency only, the second is a means of changing 
this tuning so that any desired frequency in the audible range 



FREQUENCY AND ITS MEASUREMENT 


21 


can be measured The operation of an analyzer will be dis- 
cussed later (p 39) 

FREQUENCY AND ITS MEASUREMENT 

Frequency is the measure of the number of times per second 
that a vibrating particle executes a complete cycle, as illustrated 
by the circle of reference (Fig 1) Hence, frequency has come 
to be measured m cycles (per second) rather than m double or 
single vibrations, as was formerly the custom A cycle is two 
single vibrations or one double vibration Contrary to what is 
customarily stated in textbooks on physics, frequency is not 
synonymous with pitch Pitch is determined by a direct ob 
servation of an aspect of sound as it affects the ear, whereas 
frequency is an observation of an aspect of sound which we are 
obliged to perform with the help of instruments Such instru 
ments are known as tonometers, or frequency meters 

The mechanical types of tonometers which have been 
devised for the measurement of frequency are varied and m 
genious They are based in general upon the principle of 
sounding a tone of known frequency, which is nearly the same 
as that of the frequency to be measured, and determining the 
difference between them Datta lists fifteen schemes which 
have proved more or less successful Their chief limitation lies 
in the fact that they are generally useful only for the measure 
ment of low frequencies 

In modern acoustics — as in radio engineering — electrical 
methods have come into use for the measurement of frequency 
because of their convenience and the wide range of frequencies 
to which they are adequate Since most of the tones used m psy 
chophysiological studies are produced by electrical generators, 
their frequency can be determined by measuring the frequency 
of the electric current which activates the generators Thus, m 
a simple commercial frequency meter, the electric current to 
be measured is led to an electric circuit (a bridge) consisting 
of resistances and capacitances, so arranged that, when their 
values are properly adjusted, the circuit is tuned to the 
frequency of the current Then, where the values of the re 



22 


THE NATURE OF THE AUDITORY STIMULUS 


sistances and capacitances are known, the frequency can be 
calculated In practice, however, it is customary to calibrate 
the meter so that the frequency can be read directly from a dial 
The calibration of modern frequency meters is made in 
terms of some primary standard of frequency A t) pical stand 
ard consists of an electrical oscillator to generate an electric 
current at a frequency of approximate^ 50000 cycles This 
frequency is fixed by the characteristics of a vibrating bar of 
quartz, which is made to vibrate by the electric current In 
addition there is another oscillator, known as a multivibrator, 
which generates an clcctnc wave that is nch in harmonics 
The frequency of the multivibrator is adjusted until some one 
of its harmonics coincides with the frequency of the quartz 
oscillator, and then, since the other harmonics are in fixed 
ratios, their relative frequency is known From a standard 
of this sort it is possible to obtain frequencies equal to any of 
the harmonics of the multivibrator TTie accuracy of these fre 
qucncies is better than 1 part in 1 000,000 

One of the harmonic frequencies, 1000 cycles (very closely), 
is then made to drive a synchronous clock, similar to the electric 
clocks used in the home, except that it is driven by a current of 
higher frequency The clock generates a sharp pulse of cur 
rent every thousand cycles (once a second), which can be com 
pared with the radio time signals sent out by an observatory 
Thus the frequency of the current driving the clock can be 
determined Briefly stated, this method consists essentially of 
running an electric clock by means of a frequency known only 
approximately and comparing the time told by the clock with 
the time determined by an obscrv atory The di/Tercnce, if any 
measures with extreme accuracy the amount by which the un 
known frequency differs from the frequency for which the 
clock was designed (1000 cycles in this case) Through the 
careful application of these methods, the frequency of an clcctnc 
current, which formerly was known with much less accuracy 
than such circuit constants as electrical resistance, has come to be 
the most exactly determined constant of all 

Once a single frequency is known, other frequencies which 



FREQUENCY AND ITS MEASUREMENT 


23 


arc in harmonic relation to it can be determined with great 
precision and convenience by means of a cathode ray oscillo- 
graph (see p 40) If the known frequency is impressed upon 
the horizontal plates of the oscillograph so as to deflect the beam 
of electrons back and forth in the horizontal plane, and at the 
same time a frequency is impressed upon the \ ertical plates, 
the result is a geometrical pattern, known as a Lissajous’ figure. 



PHASE 0° 45 “ 9 0° 135° I80 1 * 

Fig. 6 Lissajous figures obtained by a pair of smuso dal motions — one 
vertical the other horizontal — whose frequencies are in certain ratios and 
phase-relations. The ratios express the relation of the frequency of the vertical 
to that of the horizontal waves 

which remains stationary provided the two frequencies bear 
some simple numerical relation to each other Thus, if the two 
frequencies are identical, a circle, an ellipse, or a straight line 
is obtained, depending upon the relative phase of the two waves 






24 


THE NATURE OP THE AUDITORY STIMULUS 


If they are m the ratio 2 1, the figure is a semicircle, or figure 
eight. If the frequencies are not quite in the proper ratio, the 
figure changes at a rate which can be counted when the diver- 
gence is not too great Typical Ltssajous’ figures arc shown 
in Fig 6 

When it is desirable to record frequencies which are con 
tmuously varying, another type of meter can be used (Hunt) 
This meter incorporates an ingenious method of indicating 
directly on the dial of an ammeter the frequency of an electric 
current at any instant Each time the alternating current 
crosses the zero axis (reverses sign) it trips a pair of discharge 
tubes and allows a pulse of current of predetermined size to 
flow through an ammeter Then, since all the pulses arc the 



Ell 


umuamwmmttutt 


" £i ! >5-! 5 £IJ0T»S 


Fig 7 Tic relaton between the musical scale and frequency (After 
Hcnney ) 


same size, the reading on the ammeter is directly proportional 
to their number per second, and hence directly proportional to 
the frequency of the current Thus, simply by calibrating its 
dial, the ammeter becomes a direct reading frequency meter 
1 The musical scale is, of course, a scale of frequency Tlius 
Fig 7 shows the frequency of the notes of the musical scale, and 
the range of frequencies covered by common musical instru 


INTENSITY 


25 


ments and the human voice The particular scale illustrated 
here is the one in which all the C s are powers of 2 This scale 
is not, however, in general use by musicians The notes of the 
scales which are commonly used differ in frequency by only 
a few cycles from the scale in Fig 7 

INTENSITY 

The use of the word intensity to denote an aspect of sound 
has perhaps entailed more ambiguity than any other single 
practice in acoustics It is important, therefore, that we clarify 
the meaning of the word and adopt some standard, unambig 
uous usage of it 

First of all, wc must understand that intensity is not synony 
mous with loudness Intensity, like frequency, is one of the 
physical aspects of sound which we are able to observe only 
with the aid of instruments Loudness, like pitch, is one of 
the aspects which we observe directly Furthermore, there is 
no one toone correlation between pitch and frequency nor be 
tween loudness and mtensity 

Although the definition of frequency is a relatively simple 
matter, the definition of intensity is not so readily achieved, for 
reasons which wc shall discover Perhaps for our purpose a 
satisfactory general definition of the intensity of a plane smusoi 
dal sound wave in air is as follows Intensity is the other 
variable besides frequency whose value must be specified m 
order completely to determine the sound wave This is equiva 
lent to saying that sound is two-dimensional and that intensity 
is one of the dimensions There are several alternative ways 
of specifying, the second variable once the frequency has been 
determined In other words, intensity is a generic term desig 
nating a class of alternative ways for specifying a physical 
aspect of sound 

As we have seen, the propagation of a sound wave in air 
involves rapid alternating displacements of the air particles, and 
is associated with oscillatory changes in the pressure and velocity 
of the air In addition, the propagation of the wave entails a 
transfer of energy through the air, and ends by exerting a tiny 



28 


THE NVTURE OF THE AUDITORY STIMULUS 


which vary sinusoidally, it is customary to take as the measure 
of the quantity its root mean square value A straight a\erage 
of a sinusoidal function would, of course, be zero, for the func- 
tion is below the axis as much as it is above the axis By 
squaring all values of the function before averaging, the nega 
tive sign of the part below the axis is eliminated If the 
function is strictly sinusoidal, the rule is that the root mean- 
square value is the maximum value (the amplitude) diwded by 
the square root of 2 

Care should be taken not to confuse the alternating pressure 
of a sound wave with the radiation pressure, which is the pres 
sure exerted by the wave on an object which it strikes This 
radiation pressure is unimportant, practically, as a measure of 
intensity It is, in fact, directly proportional to the energy of 
the sound wave 

Energy is expressed in two ways, either as the average energy 
per unit volume or as the average rate of flow of energy through 
a unit area The rate of flow of energy is simply the average 
energy per unit volume times the velocity of sound The equa- 
tion for the rate of flow of energy / is usually written 

J = 

where the symbols have the same meaning as in Fig 8 Hence, 
the energy of a sound wave is proportional to the square of the 
frequency times the square of the amplitude (of the particles) 
The pressure of a sound wave, however, is proportional to the 
frequency times the amplitude, as shown in Fig 8 It follows, 
then, that the energy is proportional to the square of the alter- 
nating pressure, or in symbols 



where p stands for the root mean square value 

It must be emphasized that these relations between displace 
ment, \elocit), pressure, and energy obtain for a plane progres- 
sive sound wave In many situations, especially where standing 
waves occur, the energy is not proportional to the square of the 



THE DECIBEL 


29 


pressure, and under these circumstances the various measures 
o£ the strength of a sound-wave cannot be used indiscriminately. 
This consideration has led, in acoustics, to the adoption of a 
definition of sound-intensity in terms of the rate of flow of 
energy through a unit area of the medium (Frederick). 
This definition (see Glossary) should be used whenever the 
energy is not proportional to the square of the pressure of a 
sound-wa\ e. 


THE DECIBEL 

The difficulty of deciding upon the best measure of intensity 
is somewhat obviated by the modern convention of expressing 
intensity as a ratio rather than as an absolute magnitude. The 
need for this procedure grew out of the problems involved in 
the transmission of electric waves over networks— a major 
problem of the telephone-engineer. Thus, when an impulse 
is sent over a wire, its intensity diminishes as it progresses, and 
at the receiving end it is smaller than it was at the sending end. 
What was needed was a convenient method of expressing the 
magnitude of the impulse at the terminus in terms of its magni- 
tude at the beginning, and the logarithm of the ratio of the 
energy at the terminus to the energy at the beginning was taken 
as the measure. In honor of the inventor of the telephone, the 
logarithm (to the base 10) of this ratio defines the number of 
btls comprising the ratio. In practice it is more common to 
measure the ratio of two energies in decibels (abbreviated db). 
A decibel is one-tenth of a bel 

The number of decibels is thus defined as 10 times the 
logarithm of the ratio of two energies or powers, hut decibels 
can also be used to designate the ratio of two pressures, veloci- 
ties, voltages, currents, etc., which are related to the flow of 
energy by a square law. If N represents the number of deci- 
bels, we have . . . 

N = 10 log§ = 201og|l 

&2 F2 

where the £'$ are energies and the p's are pressures. Since the 
decibel is defined in terms of the ratio of two energies, care must 



30 


THE NATURE OF THE AUDITORY STIMULUS 


be taken m applying the formula to the ratio of two currents or 
two voltages The currents or \ oltages must always be meas 
ured across the same impedance or the formula does not hold 
In practice, we arc usually interested in comparing two voltages 
impressed upon the same loud speaker, or other device, and in 
this case we can properly say that the number of decibels be 
tween the two voltages is 20 times the logarithm of their ratio 
Properly used, the decibel has several advantages Since it is 
a logarithmic unit, it is convenient for the representation of the 
great range of intensities encountered in acoustics The ear 
can. support a sound whose energy is about a million million 
times the least energy it can detect This huge difference can 
be expressed as 120 db A second advantage is that we can add 
decibels instead of multiplying Thus if we have two ampli 
fiers m scries and each of them amplifies the voltage tenfold wc 
add 20 db for each amplifier and obtain the result that the total 
amplification is 40 db A third advantage is that the decibel 
scale provides what is essentially a common denominator for 
expressing intensities In other words, if, from a certain refer 
cnce point, the intensity of a plane progressive sound wave is 
increased 60 db, as determined by measurements of energy, it 
has also been increased 60 db as determined by measurements 
of pressure or particle velocity 

Clearly the measurement of sound intensity in decibels pre 
supposes a standard or reference mtensity, for the decibel scale 
must always represent the relation of one intensity to another 
A common point of reference is the threshold of hearing— the 
least energy necessary to arouse an auditory sensation Where 
possible, however, it is better to measure mtensity in decibels 
above the value which has been proposed as the reference 
intensity, namely, KT** watt per square centimeter (see Glos- 
sary) (The watt is a measure of the rate of flow of energy ) 
For a plane sound wave in air, this intensity is approximately 
equivalent to the root mean square pressure of 0 0002 dyne per 
square centimeter, or a particle velocity of 0 000005 cm per sec, 
and is reasonably close to the average threshold of hearing for a 
1000-cycle tone (sec Chapter 2) A scale representing the ap- 



THE MEASUREMENT OF INTENSITY 


31 


proximate intensity of familiar sounds in decibels above the 
reference intensity is shown in Fig 9 
Decibel scales will be used 


-r THUNDER 


AJRPLANE ENCINE 


BOILER SHOP 


ELEVATED TRAIN 


PNEUMATIC DRILL 


BUSY STREET 


freely in the later chapters, 
and our ability to grasp their 
meaning will be greatly aided 
if we keep certain points in 
mind Most important, we 
should remember that the 
decibel scale is a logarithmic 
scale expressing the magni 
tude of the ratio between two 
quantities Then, as a prac 
tical matter, it is well to re 
member certain useful 
equivalents When we refer 
to sound pressure, or to the 
voltage applied to a loud 
speaker, a tenfold increase 
means the addition of 20 db 
and a twofold increase means 
the addition of very nearly 
6 db By remembering that 
every time we multiply the 
pressure by 10 we must add 20 db and every time we multiply 
by 2 we must add 6 db, it is relatively easy to find the approxi 
mate number of decibels corresponding to any pressure ratio 
For example, a pressure increase of ejghtyfold is an increase of 
10 X 2 X 2 X 2 or the addition of 20 6 6 4“ 6 db, •which 

equals 38 db (For a table relating pressure ratios to decibels 
see Appendix III ) 


y SO COWERSATiC/N 

QUIET AUTOMOBILE 


average office 


AVERACE DWELLING 


0 ■*- THRESHOLD OF HEARING 
Fig 9 Loudness-levels, in decibels, 
of various common sounds 


THE MEASUREMENT OJ INTENSITY 

Drysdale has reviewed the difficulties which have tended 
to thwart the development of acoustic measurements and in 
struments He makes the following points 

1 Acoustic power is an extremely minute quantity, the 



32 


THE MATURE OF THE AUDITORY STIMULUS 


power usually available for measurement is but a small fraction 
of a microwatt (One microwatt = 10 ergs per second ) 

2 Acoustic power is not generally transmitted along defi 
mte paths, like electric currents m conductors, but is radiated 
in all directions so that only a certain portion of the total power 
emitted by the source can be picked up 

3 The wave length of most audible tones is of the order of 
a few inches to a few feet, and is therefore neither very large 
nor very small compared with the dimensions of most recording 
instruments On this account the reactions of the instrument 
on the waves are extremely troublesome, and it becomes difficult 
to obtain a true sample for measurement 

4 Owing again to the size of the w ave length, interference 
phenomena and reactions between source and receiver are likely 
to be embarrassing 

5 The construction of surfaces which will completely ab- 
sorb all sound which reaches them is difficult, if not impossible 
Hence, reflections and standing wave patterns arise to cause 
unsuspected errors 

These points, although serious, arc not insurmountable 
They represent, however, stubborn facts which every acoustical 
engineer must face when he sets out to measure sound 

Lord Rayleigh gave us what is undoubtedly the best prac 
tical method for the absolute measurement of sound intensity 
He observed that a light disk suspended in a sound field tends 
to set itself at right angles to the direction of the sound Con 
sequently it becomes possible to deselop a formula relating the 
force needed to turn the disk (the turning moment) to the rate 
of flow of the air past the disk The turning moment is pro- 
portional to the mean square \ elocity of the air particles A 
Rayleigh disk, properly constructed and employed, can be used 
as a standard in terms of which other instruments, such as 
microphones, can be calibrated 

Practical measurements of sound intensity in the laboratory 
) arc nearly always made by means of a calibrated microphone 
/ that is to say, a microphone for which the relation between the 
current generated and the pressure on the diaphragm has been 



THE MEASUREMENT OF INTENSITY 


33 


determined for all frequencies to be measured When the re- 
lation of current to pressure is known, the microphone is said 
to have a press urc<ahbratio n Then, by placing the micro- 
phone in a sound field and measuring the current generated, 
we can compute the value of the sound pressure at the face of 
the microphone 

A complication anses when we place an ordinary micro- 
phone in a sound field, because the presence of the microphone 
tends to distort the field, and the pressure on the diaphragm is 
not what the pressure of the sound was at the same place before 
the instrument was placed there To obviate this difficulty, we 
resort to a field<alibration, that is to say, we determine the in 
tensity of the field, perhaps by means of a Rayleigh disk, and 
then insert the microphone and determine the current gen 
crated Then, by placing the microphone in an unknown field, 
properly oriented with respect to the direction of the sound 
wave, we can determine the intensity which would exist in 
the unknown field at that point if the microphone were not 
there 

In an effort to obtain a microphone which will not distort 
the sound field and which will give the same response regardless 
of the direction of the sound wave, many types of instrument 
have been developed In order not to distort the field in any 
way, the microphone must be made infinitely small, but the 
ideal of a nondirectional microphone — one which responds 
equally well to sound from all directions — can be achieved, in 
practice, by a properly designed baffle around the microphone 
(Marshall) 

Similar considerations apply to the distortion and direc 
tionality consequent upon placing the head in a sound field 
Thus, in general, two procedures are possible for the measure 
ment of such items as minimum audible intensities The m 
tensity of the field may be determined without the presence of 
the human observer, and this mtensity taken as the intensity 
of the stimulus In this case the head must be oriented in some 
standard manner towards the source of sound Or the mtensity 
of the sound at the eardrum of the observer may be determined 



34 


THE NATURE OT THE AUDITORY STIMULUS 


after he is in the field, a procedure which msohcs certain diffi 
cult ics, however (see Chapter 2) 


THE ELECTRICAL ANALOGY 
Throughout the preceding discussion, attention has fre 
quently been called to electrical devices and methods Not 
only does the modern science of acoustics owe its recent ad 
\ances to the invention of practical instruments for the produc- 
tion and measurement of sound waves, but the theory of sound 
itself has been extended with the aid of the methods of analysis 
which were originally developed to deal with electrical phe 
nomena A mechanical or acoustical system is composed of 
elements whose relationships can be treated by the same form 
of differential equations which arc basic to the analysis of elec 
trie circuits Each variable of the equations in the theory of 
circuits has its analogue in a variable of the corresponding 
mechanical equation The conventional analogy betw cen some 
of these variables is as follows 


Mechanical 

Force resembles 

Velocity 

Mass 

Frictional resistance 

Compliance (i e , 1 /stiffness) " 

Displacement 


Electrical 

Emf 

Current 

Inductance 

Resistance 

Capacity 

Quantity of charge 


An alternative analogy has been suggested by Firestone, who 
points out some advantages not possessed by the conventional 
analogy In other words, there is nothing sacred or in any 
sense fundamental about the identification of mass and in 
ductance, for we could equally well construct the equations in 
such a way that mass and capacity would play analogous roles 
Nevertheless, the use of these analogies— conventional or 
otherwise — has led to many elegant treatments of acoustical 
problems, and the student of acoustics finds the study of circuit 
theory an essential element of his training Thus the marked 
progress in the acoustics of absorbent materials, m the design 



ELECTRICAL APPARATUS 


35 


of acoustic filters, in the construction of all sorts of electroacous- 
tic transducers for the conversion of sound into electric energy, 
and vice versa, has hinged upon the development and use of 
such functions as acoustic impedance, which, like electric im 
pedance, is a concept of great utility and power For an exten 
sive application of these notions, the reader is referred to modern 
works on acoustics 

ELECTRICAL APPARATUS 

The purpose of this section is to sketch the characteristics 
of the electrical apparatus most commonly encountered m 
modern experiments in hearing The descriptions are not of 
particular experimental set ups, but apply to what might be 
called an idealized set of equipment All such apparatus has 
been used in the studies to be reported in later chapters 



OSC LLOGRAPH 

Fie 10 The elements of an experimental set up designed to produce and 
record any desired tones 

We may begin by referring to Fig 10, which is a schematic 
diagram of an ideal arrangement for the investigation of psycho- 
physiological acoustics The upper system is for the generation 
of pure tonal stimuli at any desired frequency and intensity 
The lower system is for the recording, measuring, and analyzing 
of sound We shall consider each element in turn 












36 


THE NATURE OF THE AUDITORY STIMULUS 


It is perhaps desirable to consider first the amplifier, for 
oscillators are essentially a special type of amplifier The core 
of the amplifier is the vacuum tube The vacuum tube, called 
in England by the suggestive name t alve, is simply a device in 
which a small electrical potential is able to control the behavior 
of a large current It is like a valve \\ hose opening is regulated 
by a force which is very small compared to those which its 
opening may bring into play The vacuum tube is connected 
in the circuit as shown diagrammatically in Fig 11 The fila 
ment F is heated by current from the battery A The heating 
so agitates the electrons in the 
filament that some of them 
fly off into the surrounding 
space, and, since they are 
negative charges, they arc 
drawn towards any body pos 
scssing a positive charge 
Such a body is the plate P , 
which is positively chargcdby 
being connected to the bat 
tery B Mow, the stream of 
electrons which flows from the filament (cathode) to the plate 
(anode) is very sensitive to any charge it might meet on the 
way Consequently, a perforated grid G is placed between the 
filament and the plate, and any charge induced on the grid is 
effective in determining the number of electrons that are able 
to get across Then, when the voltage to be amplified is 1 m 
pressed on the grid, and makes the grid swing negative, the 
electrons are hindered m their passage, when it makes the grid 
less negative, the electrons arc accelerated The result is that 
each change in the grid voltage is reflected m the current 
through the resistance R and hence m the voltage across R 
This output voltage can then be led to the grid of another tube, 
and so on until the original \oltagc has been amplified millions 
} of times There arc numerous ways of coupling one stage to 
another, each with its particular advantage for certain types of 
service For most work in acoustics the ideal amplifier is one 



Fic 1 1 Tic elements of a 
tube ampl tier 




ELECTRICAL APPARATUS 


37 


which amplifies all frequencies equally well, and does so with 
out introducing amplitude-distortion due to nonlinearity The 
latter ideal is met in an amplifier, as in a mechanical system, 
when the response of the system (output voltage) is propor 
tional to the impressed force (input voltage) symmetrically m 
both directions If the ideal is not met, harmonics are intro- 
duced into the output 

An oscillator is simply an amplifier in which there is provi 
sion for leading back to the grid circuit a small part of the 
power in the output circuit The part led back, provided that 
it arrives at the grid circuit in the proper phase, is then amplified 
again to produce more power in the output circuit from which 
another small part can be led back to repeat the process all over 
again The phase and the frequency at which the impulses are 
sent back to the grid circuit to be reamphfied are determined 
by a tuned circuit that is, a circuit consisting of inductance and 
capacity adjusted to give it the desired natural frequency This 
natural frequency can then be conveniently regulated simply 
by changing the capacity of the circuit 

In order to obtam any frequency in the audible range by 
merely turning the dial of one variable condenser, it is necessary 
to resort to the method of beat frequencies Two oscillators 
are constructed with natural frequencies of the order of 100,000 
cycles, and the frequency of one of them is controlled by a 
variable condenser The outputs of both oscillators are led 
to a single mixer tube where they interact to produce beats 
The beats, whose frequency is equal to the difference between 
the frequencies of the two oscillators, are then amplified and led 
to the output of the instrument Thus, if one oscillator is 
tuned to 100,000 cycles and the other to 101,000 cycles, the fre 
quency obtained from the output will be 1000 cycles The two 
high frequencies do not appear in the output because they are 
filtered out by the proper arrangement of the circuits An al 
ternative name for this type of oscillator is heterodyne oscillator 
The third element of our producing system might well be a 
voltmeter to measure the output of the amplifier and provide as- 
surance that its behavior is constant 



38 


THE NATURE OF THE AUDITORY STIMULUS 


Next, in order to impress any desired voltage on the loud 
speaker, an attenuator is introduced An attenuator is a net- 
work of resistances designed to reduce the voltage and at the 
same time keep the other circuit constants, such as impedance, 
unchanged An attenuation network is usually constructed 
in sections, each section designed to decrease the voltage by 
a certain number of decibels Then, simply by adding in or 
taking out sections, any degree of attenuation can be obtained 

If the output of the amplifier contains undesired harmonics, 
they can be eliminated by a properly designed filter A filter 
is a network which offers more impedance to the passage of 
some frequencies than others The elementary notion of a 
filter can be illustrated if we imagine an inductance placed 
across the output of our amplifier Now, since an inductance 
offers small impedance to low and large impedance to high 
frequencies, the low frequencies would be shunted through the 
inductance, and only the high frequencies would be passed on 
to the speaker On the other hand, if a condenser is placed 
across the output, it shunts out the high frequencies and lets 
the low frequencies pass by So, by the proper combinations 
of inductances and capacitances, filters can be designed to pass 
any desired frequencies and stop all others 

An analogous theory can be applied to the construction of 
tubes which have openings and side tubes that filter certain 
frequencies out of a sound wave These are called acoustic 
filters (Stewart, 5) 

Finally, we arrive at the electroacoustic transducer This 
may be any sort of loud speaker or receiver So many varieties 
have been developed — magnetic, moving-coil, thermoelectric, 
piezoelectric, condenser, eddy current, ribbon, etc —that their 
description cannot be undertaken here The ideal high -quality 
generator of sound is one which responds equally to currents 
of all frequencies in the audible range Another requirement, 
and one which is realized m most types of instrument, is that the 
alternating pressure of the sound wave generated he propor 
tional to the impressed voltage When this is the case, we hive 
simply to measure the voltage, and we have a relative measure 



ELECTRICAL APPARATUS 


39 


of the sound intensity This relationship is true, of course, only 
when w e release the sound in an absorbent room where inter 
ference phenomena are absent- 

Next w e shall consider the recording s) stem A mtcrophone 
is essentially a receiver which is made to work backwards It 
converts sound-energy into electric energy As might be ex 
pected, there are about as many types of microphones as there 
are receiv ers, and in general the same ideals must be met by a 
high-quality instrument, namely, uniform response at all fre 
quencies and a response (\oltage output) proportional to the 
pressure (or velocity) of the sound wave As we have pre 
v lously noted, an additional requirement might be that the 
microphone be nondirectional and produce negligible distortion 
when it is placed in a sound field 

From the amplifier, which amplifies the response of the 
microphone, the current may be led to one or all of the instru 
ments indicated in Fig 10 The voltmeter would permit us to 
determine the intensity of the sound, provided the microphone 
had been calibrated and we knew the gain of the amplifier 



OSOU-«RAPH 

Fig. 12. The dements oi a record ng wave analyzer 

The frequency meter would determine the frequency of the 
sound If the sound were complex and it was desired to know 
its frequency composition, a wave-analyzer would be needed 
An automatic analyzer (Hall, 1) is shown schematically in 
Fig 12 A microphone picks up the sound wave to be analyzed 
and delivers it as an electric wave to the mixer, which performs 
the same function as the mixer in the beat frequency oscillator 
The other wave led to the mixer comes from an oscillator de- 
signed to produce frequencies between 20,000 and 30 000 cycles 






40 


THE NATURE OF THE AUDITORY STIMULUS 


The mixer then produces a frequency which is the difference 
between that of the sound at the microphone and that coming 
from the oscillator, and sends it on to a filter (a magnctostnc- 
tivc rod) tuned very sharply to 20,000 cycles No other fre- 
quency can pass this filter Therefore, nothing gets through 
to the rectifier unless the difference between the frequency of a 
component of the wave from the microphone and the frequency 
from the oscillator is 20,000 cycles Thus, when the micro- 
phone picks up 500 cycles, the filter passes a wave only when the 
oscillator produces a frequency of 20,500 cycles The trick then 
is to make the oscillator sweep from 20,000 to 30,000 cycles con- 
tinuously, and note when a frequency gets through the filter 
and is recorded on the cathode ray oscillograph The dial of 
the oscillator can be driven by a motor and the response on 
the oscillograph can be recorded on a moving film, so that a 
permanent record is made of the intensity of each component 
frequency in the wave picked up by the microphone 

The purpose of a cathode ray oscillograph is to reproduce in 
a \ isual pattern the characteristics of an electric wave. Fig 13 
is a schematic representation 
of the essential parts of a cath- 
ode ray oscillograph One 
type consists of a glass tube, 
evacuated to a high vacuum, 
m the small end of which is 
a filament (cathode) which, 
upon heating emits electrons 
These electrons arc drawn 
violently forward by an anode 
which bears a positive charge 
of a thousand volts or more In the anode is a small hole 
through which some of the electrons pass with sufficient velocity 
to carry them on to the large end of the tube Thus a small 
stream of electrons is shot from one end of the tube to the 
other, like water from a garden hose, and each time an electron 
strikes the large end of the tube, which is coated with a 
fluorescent material, it makes a little ‘splash’ of light Hcncc 



Fic 13 11 c elements of a cathode- 
ray oscillograph 




ELECTRICAL APPARATUS 


41 


one sees a bright spot where the stream strikes the fluorescent 
screen Now, in order to obtain a pattern, it is merely neces 
sary to deflect the stream by passing the electrons between a 
pair of metal plates on which there is an electric charge that 
can attract or repel them, depending upon the sign of the 
charge Two pairs of plates — vertical and horizontal — permit 
deflection of the beam m both directions 

In order to obtain a true representation of an electric wave, 
it is customary to employ a device know n as a sweep circuit A 
sweep-circuit impresses on the vertical plates a potential that 
grows at a constant rate and pulls the spot horizontally across the 
face of the tube Then suddenly the potential falls to zero 
and the spot flashes back to the other side of the screen, where 
it starts another trip across the tube at a constant speed The 
action of a sweep-circuit depends upon the alternate charging 
and discharging of a condenser through a gaseous discharge 
tube Now, with the spot traveling uniformly across the face 
of the tube, it becomes a simple matter to impress the potential 
to be studied (the signal) upon the horizontal plates and 
thereby make the spot move up and down at the same time 
The net result is a true picture of the electric wave 

The study of electric and acoustic phenomena has been tre 
mendously aided by the cathode ray oscillograph, and it is per 
haps no exaggeration to say that much of the recent work on 
the physiology of hearing would have been impossible with 
out it 



CHAPTER 2 


THE SENSITIVITY OF THE EAR 

The absolute sensitivity of the ear is determined by the minimal 
energy or sound pressure needed to excite a sensation of hearing 
This amount of energy is called die threshold value In addition 
to the various practical problems which arise when we set out 
to determine the threshold of hearing, such as the apparatus 
and procedure to be used, two fundamental questions must be 
decided first, the form of energy, electrical or mechanical, that 
wc shall take as the stimulus, and, second, the point m the long, 
continuous process from the original generator of the energy 
to the final experience of tone at which we shall choose to 
measure the energy Obviously, very different results would 
follow from measuring at, say, the generator and at the eardrum, 
or even in the auditory nerve Different experimenters have 
selected different points for the measurement of the least audi 
ble energy, but in general, for the determination of the threshold 
of hearing for sound waves in air, two types of measurement 
have predominated (1) the minimum audible sound pressure 
at the eardrum, and (2) the minimum audible sound field m 
which the observer is placed for listening 

MINIMUM AUDIBLE PRESSURES 

A direct measurement of that very small intensity of the 
sound wave at the eardrum which will just elicit a sensation of 
hearing is practically impossible The pressure at the threshold 
is so small that there has been devised no method sufficiently 
sensitive to permit its direct measurement at the eardrum Most 
methods, therefore, have to establish a known pressure at some 
measurable intensity, well above threshold, and then determine 
the amount by which the pressure must be reduced m order to 
reach the threshold value Two principal types of procedure 
42 



MINIMUM AUDIBLE PRESSURES 


43 


have been used for the determination of the sound pressures at 
the eardrum 

1 A telephone receiver, a thermophone, or an electrody 
namically driven piston is employed as the source of sound and 
is held tightly to the ear in such a w aj as to enclose a known 
volume of air The receiver is calibrated, that is to say, the 
sound-energy emitted b> the receiver into a closed chamber of 
known volume is determined either by the direct optical meas- 
urement of the motion of the diaphragm or by the measurement 
of the electrical power consumed in the receiver, or else it is 
computed from the response of a calibrated microphone placed 
m the chamber Then, knowing the output of the source of 
sound and the volume enclosed between it and the eardrum, 
one can compute the pressure on the drum This procedure 
requires that certain assumptions be made It is necessary to 
assume that the pressure throughout the ear-enclosure is urn 



various experimenters (After Sivun and White ) 

form, and it is usually desirable to assume also that the walls of 
the ear-canal are rigid, although some experimenters have made 
allowance for the yielding of the drum itself 

Several experimenters have employed a procedure in which 
a receiver is fitted tightly to the ear The curves in Fig 14 



44 


THE SENSITIVITY OF THE EAR 


which were determined m this manner are those of Minton and 
Wilson, Wegel, Riesz, and Blackman, Bekesy, Kranz, and 
Fletcher and Wegel The curve for Meyer is the average of 
data obtained with a telephone reccn er and with the ear in an 
open sound field Consequently, although Meyer reported no 
systematic difference betw een the two methods, it is difficult to 
say whether or not his results should be grouped with those 
obtained solely by the method of the tightly fitting receiver 
2 The second method for determining minimum audible 
pressures employs a small ‘search tube’ connected at one end 
to a calibrated microphone The other end is inserted into the 
ear so that it is fairly close to the eardrum Then, if the search 
apparatus is calibrated, the pressure at the end of the tube near 
the drum can be determined, and, on the assumption that the 
pressure at the drum is the same as at the end of the tube, the 
minimum audible pressure on the eardrum can be measured 
Here again it is necessary to measure the pressure at some value 
well above threshold and then to determine by how much the 
sound must be attenuated to reach threshold The data repre 
sented in Fig 14 by the curves for Sivian and for Munson were 
determined by this method 

THRESHOLD AT LOW FREQUENCIES 
Measurement of the absolute sensitivity of the ear at fre 
quenejes below 50 cycles is dependent upon an ability to gen 
eratc extremely pure tones The characteristics of the thermo 
phone arc such that activation by two sinusoidal currents pro- 
duces a sinusoidal tone whose frequency is the difference be 
tween the frequencies of the two currents Hence, low tones 
of large amplitude can be produced by allowing pure alternat 
ing currents to beat in a thermophone (B£k£s), 24) Using 
this device as a source of sound, Bek&y (22) w as able to report 
the measurement of auditory thresholds (minimum audible 
pressures) at extremely low frequencies His results disclose, 
furthermore, that at frequencies below 50 cycles the basic quan 
tal nature of the auditory process manifests itself in a step like 
threshold curve By raising the frequency of a tone slowly 



THRESHOLD AT LOW FREQUENCIES 


45 


and continuously from about 2 to about 50 cycles, one can ob- 
serve that the loudness and the pitch of the tone do not change 
evenly, but by jumps Figure 15 shows a threshold curve ob- 
tained by the following procedure Beginning with about 2 



Fig 15 The minimum audible pressures for low frequencies This thresh 
old curve shows the step-hke character which miy indicate the quanial nature 
of die processes involved. The most prominent step occurs at 18 cycles 
(After Bckcsy 22) 

cycles at a high intensity, the experimenter decreased the in 
tensity until nothing was heard and then increased the frequency 
until the sensation reappeared Thereupon the intensity was de 
creased (step wise) until the tone ceased to be heard, and again 
the frequency was raised continuously until the sensation was re 
ported Below 4 and abo\ e 50 cycles no steps could be detected, 
but between these frequencies steps occurred with regularity at 
the frequencies shown in Fig 15 (4 5, 6, 75, 9, 11, 14, 18, 22, 28, 
32, and 38 cycles) Note that the curve in Fig 15 agrees well 
with what would be a reasonable extrapolation of the curves 
of Fig 14 (cf Fig 19) 

The threshold step at 18 cycles is the most readily detectable 
Approaching it from a lower frequency, the observer expen 
cnees at 18 cycles a sudden increase m loudness and in pitch 
In fact, it is at 18 cycles, according to Bekesy, that we pass 



46 THE SENSITIVITY OF THE EAR 

suddenly from the perception of a succession of discrete im 
pulses to a single fused sensation which possesses a truly tonal 
character Hence, 18 cycles may be called the fusion frequency 
of pitch perception (cf Brecher) A decided roughness is pres- 
ent at this frequency, which disappears only gradually with 
increasing frequency 

What then, we may ash, is the nature of the sensation at 
frequencies below the fusion frequency ? In this region, the 
observer listening monaurally has the impression that die alter 
nating pressure gives nse directly to a tactual sensation That 
this is not tactual in the ordinary sense, Bckesy demonstrates by 
showing that, under equal binaural stimulation, a tone of 10 
cycles gives rise to an auditory sensation which is localized in 
the middle of the head, and which can be shifted from side to 
side by altering the intensity of the sound in one car (see Chap 
ter 6) This cannot be done when tactual pressures are applied 
to the external car Nevertheless, tones of very low frequency 
do elicit tactual sensations at high intensities (cf Fig 19) 

Wcver and Bray (9) produced tones with a pistonphone, and 
investigated the phenomenal correlates of low frequencies 
Their observers described four phenomena, noise, intermittcnce, 
thrusting effect, and tone, as appearing successively when the 
frequency was increased from 5 to 60 cycles A suggestion of 
tone first appeared at about 20 cycles, although it was accom 
pamed by a noisy flutter At 25 cycles the tonal component 
was definite for most observers 

In all this work on tonal thresholds at low frequencies it has 
not yet been possible to evaluate the role of the aural harmonics 
(see Chapter 7) in the production of the sensation of tone At 
the high intensities needed to stimulate the ear at frequencies 
below 50 cycles, there are undoubtedly harmonics of great in 
tensity introduced by the nonlinearity and asymmetry of the car 
itself The problem can therefore be phrased Docs the sensa 
tion of tone arise when one of the harmonics reaches a certain 
value of frequency, or must the fundamental itself achieve a 
certain rate ? 



MINIMUM AUDIBLE HELDS 


47 


MINIMUM AUDIBLE FIELDS 

The intensity of a sound wave in free space which will just 
elicit a sensation of hearing in an observer who enters the space 
is known as the minimum audible field To determine it, the 
intensity of the sound is first measured without the observer in 
the field, and then the observer enters the field and listens to the 
sound As for other threshold measurements, the intensity 
must be determined at a value above threshold and then reduced 
until the threshold is reached The intensity at the ear of the 
observer is obviously not the same as the intensity of the field 
at that point before the listener entered the field, because an 
object as large as the human head distorts the sound stream 
Nor is the intensity at the ear the same when the head is oriented 
differently with respect to the source of sound, le, when the 
source is at different azimuths The effect of the observers 
head in the sound field is analogous to that of a ball held in a 
stream of water — the pressure on the ball is not the same at 
every point Furthermore, the variation of sound pressure at 
the ear with orientation of the head depends upon the frequency 
of the tone (see p 168) 

In order to avoid inconstancies due to different orientations 
of the observer, it is advisable to adopt a standard procedure for 
presenting and listening to tones in free space It has been 
proposed (Fletcher and Munson, 1) that the standard manner 
of listening shall require the observer to face a source, which 
shall be small, and to listen with both ears at a position such 
that the distance from the source to a line joining the two ears 
is 2 meter Actually, the sound ware at l meter horn a small 
source is a spherical wave, but the difference between the effect 
of such a spherical wave and a plane wave on an object the size 
of the head is negligible for the present purpose Therefore, 
the data obtained under the standard manner of listening can be 
accepted as valid for plane progressive sound waves in air 

Sivian and White determined the minimum audible field 
under the standard method of listening for thirteen observers, 
over the range from 60 to 15,000 cycles In order to obtain a 
progressive wave in a closed room it is, of course, necessary to 



43 


THE SENSITIVITY Of THE EAR 


prevent reflected waves from reaching the observer Only the 
wave directly from the source may reach his cars Thus, Sivian 
and White placed their source, a loud-speaking receiver, in a 
highly absorbing acoustic structure called a “sound stage,” and 
seated the observer 1 meter in front of the source The intensity 
of the sound field prior to placing the observer m it had been 
measured (by means of a condenser transmitter whose field 
calibration was obtained with a Rayleigh disk) The observer 
was provided with a push button which lighted a small lamp, 
and he was instructed to press the button whenever and as long 
as he heard the tone The experimenter then proceeded to 
attenuate the tone until threshold was reached Some of the 
results of this experiment are shown in Fig 16, together with 
results obtained by other experimenters 



Fio 16 The minimum audble fields as determined by various experi 
me liters (After Sivian and White ) 


The other curves in Fig 16 represent data of the same type 
(minimum audible field), but they were not obtained by listen 
mg in the standard manner and must be appraised accordingly 
The lowest curve is for data obtained by Wien These thresh 
olds are apparently lower than those obtained by later workers, 
but the reason probably lies m the manner of presentation of the 


MINIMUM AUDIBLE FIELDS 


49 


stimuli The observer’s head was situated behind a sheet-iron 
screen and his ear protruded through a hole in the screen The 
source was then placed 30 cm from the hole m the screen, so 
that the sound came to the observer directly from the side 
rather than from the front Sivian and White report that one 
of their best observers, when listening to tones coming from 
the side (90° azimuth), gave thresholds which agreed closely 
with Wien’s It appears, therefore, that Wien’s data present 
the thresholds which might be obtained with an observer of 
very good hearing, listening under the most favorable azimuth 
conditions 

The data obtained by Meyer are the same, as those recorded 
in Fig. 14 Plainly, at high frequencies his values resemble 
measurements by the minimum audible field more closely than 
by the minimum audible pressures, even though he reported no 
difference between the two types of measurement 

Many other experimenters (reviewed by Sivian and White) 
have made measurements of minimum audible fields, although 
not under the standard conditions of listening In general their 
results show less agreement than those obtained for minimum 
audible pressures (Fig 14) Some of the important differences 
between the various sets of data for minimum audible fields 
are: (1) age of observers, (2) other individual differences, 
(3) number of ears tested, (4) the type of sound field, (5) the 
orientation of the observer with respect to the sound field The 
last two factors are probably the most important They ac- 
count, most likely, for the fact that the variability is greater 
among field than among pressure measurements 

Even under the most favorable conditions the measurement 
of thresholds for hearing is beset by factors causing variability, 
so that the threshold necessarily emerges as a statistical concept 
The inherent variability of the observer himself can be easily 
demonstrated by presenting him with a steady tone at an inten- 
sity very near threshold and requiring him to press a button 
during all the time that he hears the tone Almost without ex- 
ception observers press the button intermittently The thresh- 
old should be the most probable value of the stimulus which 



50 


THE SENSITIVITY OF THE EAR 


will just excite a sensation of hearing, and for that reason it is 
always necessary to take the average of a series of values which, 
at different times, hate proved just sufficient to elicit a response 
from the observer The classical methods of psychophysics 
(Guilford) have been developed as an aid m determining that 
value of the stimulus which it is best to call the threshold Set - 
eral variations of these methods have been adopted by the 
experimenters whose work has been discussed 

NATURE OF PRESSURE AND 
FIELD MEASUREMENTS 

In order to determine the most representative set of values 
for the threshold of hearing in terms of both minimum audible 
pressure and minimum audible field, Sivian and White re- 



(At A P ) and minimum audible field (M I F) In drawing these cunn 
Sivian and White give careful consideration to the results of ihetr own and 
other experimenters data The arrow indicates the standard reference 
pressure (After Stvian and W hite ) 

viewed most of the previous work done on the subject, and, by 
considering the nature of each experiment, they were able to 
arrive at the values represented by the cun es in Fig 17 Cun e 1 
represents the values for the threshold of hearing when measure- 




NATURE OF PRESSURE AND HELD MEASUREMENTS 


51 


ment is in terms of sound pressure at the eardrum The curve 
is a weighted average of the results obtained by several experi 
inenters In weighting the various results, the number of ears 
investigated and the experimental procedure employed were 
carefully considered The ages of some of the observers, how 
ever, were not known, so that we cannot be perfectly sure that 
curve 1 represents the hearing of young people with no abnor 
malities Nevertheless, considering the number of separate 
studies on which curve 1 is based, and the good agreement 
among them, the composite curve for minimum audible pres 
sures is probably reliable and valid 

Curve 2 represents the threshold intensity of a sound field 
when the observer faces the source and listens with both ears 
This curve is based almost wholly on the results obtained by 
Sivian and White, and these results apply definitely to young 
people with good hearing In adopting the particular form 
of the curve shown in Fig 17, some account was taken of the 
several other determinations available The peculiar wavy ap- 
pearance of cun e 2 is due to diffraction of the sound wave 
around the head of the observer as he faces the source of sound 
The pattern of diffraction is not the same at all frequencies 
Moreover, the form of curve 2 would be different if some other 
azimuth had been chosen, such as placing the source at the side 
instead of m front of the observer 

It is interesting to inquire, therefore, what the form of the 
threshold curve would be if the source were moved continu 
ously around the head of an observer with equally good hearing 
in both ears What would be the threshold curve for the case 
in which the sound is able to reach the obsener directly from 
all sides ? Knowing the curve for the case in which the ob 
server faces the source, and having at hand adequate data on the 
form of the ‘sound shadow’ cast by the head at different fre 
quencies, we can calculate the minimum audible field for ran 
dom incidence of the sound live result of such a calculation 
is depicted by curve 3 in Fig 17 At all except the low fre 
quencies, where sound shadows are very slight, the thresholds 
for random incidence are lower than those obtained with the 



52 


the sensitivity of the ear 


observer facing the source This result is to be expected, be 
cause, with the sound coming from all directions, the minimum 
depends on the optimum direction for the orientation of the 
head relative to the source of sound at every frequency 

In Fig 17, curve 1 is for monaural listening Curve 2 is 
indicated as representing binaural listening Before comparing 
the two sets of data it is vv ell to inquire as to the effect of bmauril 
listening on the auditory threshold In the data of Sivian and 
White, the monaural thresholds were hardly distinguishable 
from the binaural thresholds That is to say, the rather large 
variability of threshold data prevented any reliable difference 
from being established between monaural and binaural thresh 
olds Other experimenters, however, have shown that the 
threshold for binaural listening is lower than for monaural 
In fact, Hughes has demonstrated that, in order to reach thresh 
old, the total energy required when the tone is led to the two 
ears is equal to the energy required in one ear, regardless of the 
actual division of energy between the two ears Any fraction 
of the energy needed to produce a sensation of hearing m one 
car can be diverted to the opposite ear and a sensation still 
results Thus a subliminal stimulus in the right car lowers the 
threshold for the left, and this occurs even when the tones m 
the two e3rs are of different frequency Holvvay and Upton 
likewise showed that throughout the audible range of frequen 
cies the binaural threshold is lower than the monaural threshold 
for either ear Each of thirty subjects tested w ith a tone of 800 
cycles showed this effect, and for a majority of these persons 
the difference between the binaural and monaural thresholds 
was approximately 6 db Hence it appears that the nervous 
excitations from the two ears summatc when the stimuli arc 
tft w Vrttaw ‘httv&aJeL, ytrA ifory Vae/Aw wwtv {vet 

115), and consequently the threshold for listening with both 
ears in an open sound field should be lower than that obtained 
with a receiver on one car This fact may account for some 
but not all the difference between curves 1 and 2 in Fig 17 
Let us consider, therefore, some additional items affecting the 
measurement of thresholds 



COMPARISON OF PRESSURE AND HELD MEASUREMENTS 53 


COMPARISON OF PRESSURE AND 
FIELD MEASUREMENTS 

The values for minimum audible fields lie from 10 to 20 db 
below the values for minimum audible pressures when the latter 
are measured at the eardrum In other words, we are faced 
with the apparent contradiction that, when an observer is listen 
mg to a tone which he is just able to hear, the intensity of the 
sound field outside his ear is less than the intensity at his ear- 
drum The resolution of this difficulty must, of course, lie in 
the discovery of factors which tend to prejudice the measure 
ment of sound intensities in one or the other, or both, of the 
two cases As yet, however, we can only suggest what some 
of these factors might be, we cannot prove that they account for 
all the discrepancy At high frequencies we might well expect 
the two types of measurement to differ because of sound 
shadows (diffraction) caused by the head and the pinnae, and 
because of anomalous wave motion m the auditory canal, even 
though the physical measurements were perfect in both cases 
For frequencies in the middle range, it can be shown that the 
sound pressure at the eardrum is greater than that in the ex 
ternal sound field, because of the resonance of the external 
ear-canal The resonant frequency of this small chamber is very 
near 3000 cycles, and consequently, near this frequency, the 
pressure at the drum may be as much as 3 times greater than the 
pressure outside the meatus (Bekesy, 11) At low frequencies, 
however, where measurements of pressure at the eardrum are 
most reliable, where resonance effects are ml where diffraction 
is slight, and where, also, the age of the observer is of less mo 
ment, it is difficult to account for the wide differences between 
curves 1 and 2 in Fig 17 

It is conceivable that the effect called physiological noise, 
which is associated with the tight fit of the sound receiver on 
the ear, may mask the tones (see Chapter 8) when minimum 
audible pressures are being measured On account of pulse 
actions, breathing, etc , mechanical vibrations are set up in the 
enclosed space whenever a receiver is pressed to the ear (Bekesy, 
24) This is the same effect that delights children when thev 



54 


THE SENSITIVITY OF THE EAR 


hold a sea shell to their ear and listen to the “ocean’s roar ” 
The sensation is of a low frequency noise which tends to raise 
the thresholds, particularly for low frequencies 

Other factors, whose effects are difficult to evaluate, are the 
changes in static pressure and the temperature in the ear-canal 
when a receiver is on the ear These factors might affect the 
acoustic impedance of the eardrum and also cause fatigue and 
annoyance A tightly fitting receiver might also introduce 
variable amounts of interference from bone conduction (see 
P 291) 

Another factor which might play an important part in differ- 
entiating open field hearing from hearing with the receiver 
fitted tightly to the car is the change in tension of the muscles of 
the middle ear for sounds of different intensity Loud sounds 
cause the tensor tympani to contract and exert a tension on the 
eardrum, which in turn impedes the transmission of acoustic \i 
brations from the external to the inner ear (see p 266) Now, 
pressure measurements are necessarily made at an intensity well 
above threshold— about 60 db — and the sound is then attenu 
ated until threshold is reached Field measurements, on the 
other hand, can be made at levels much nearer threshold It 
may be that in the case of pressure measurements, as the sound 
is attenuated the tension on the drum is relaxed, so that the 
acoustic impedance of the car is changed, especially at low 
frequencies This change of impedance would mean that the 
energy consumed in driving the eardrum is not a linear function 
of the energy emitted by the receiver, and consequently the true 
threshold of hearing cannot be measured in terms of the amount 
of attenuation at the receiver 

A final evaluation of these several factors which influence 
iVlt TTitHSuTCTTitTii xA m'lTiYm’uTTi fitYus uTid TaYMHSWKW 

audible pressures is not possible at present 

RELATION OF THE AUDITORY THRESHOLD 
TO THE REFERENCE INTENSITY 

It is interesting to compare the auditory thresholds to the 
standard threshold (reference intensity), in terms of which it 



MOVEMENT OF THE EARDRUM AT THRESHOLD 


55 


is customary to express acoustic intensities (see Glossary) The 
reference intensity is, for a plane progressive sound wave, equiv 
alent to a presure of 73.8 db below 1 dyne per square centimeter 
This value is indicated by the arrow on the ordinate scale of 
Fig 17 The minimum audible pressure m the region where 
the car is most sensitive (2000 to 3000 cycles) is about 5 db above 
the reference intensity On the other hand, the minimum audi 
ble field is almost 10 db below the standard level of reference in 
the same region 

THE AMPLITUDE OF MOVEMENT OF THE 
EARDRUM AT THRESHOLD 

The absolute displacement of the eardrum at the threshold 
of audibility is extremely small — much too small to be meas 
ured directly, for, throughout the region where the ear is most 
sensitive, the drum moves through, a distance equal only to 
about one thousandth of the wave length of light Hence, the 
measurement of amplitude must be made at a frequency or 
intensity at which displacement is measurable, and then the 
amplitude at threshold may be determined by extrapolation 

A recent method (Wilska) made use of a device, similar to 
an electrodynamic loud speaker, to drive a light wooden shaft, 
8 cm long, one of whose ends was cemented to the eardrum The 
other end was fastened to the moving coil of the electromecham 
cal transducer which was mounted rigidly on the side of the 
subject's head Then, with a microscope, the amplitude of mo- 
tion of the shaft was determined as a function of the frequency 
and magnitude of the current in the moving coil This could 
be done only at low frequencies However, it was assumed 
that (1) the amplitude decreases inversely as the square of the 
frequency and (2) directly as the magnitude of the current in 
the moving coil Then, by measuring the current at the thresh 
old of hearing for different frequencies, the experimenter could 
calculate the absolute movement of the eardrum at threshold 
The results are shown by the circles in Fig 18 

For purposes of comparison, the solid curve in Fig 18 gives 
the amplitude of motion of the air particles in a plane sound 



56 


THE SENSITIVITY OF THE EAR 


wave whose root-mean-square pressure equals the threshold 
pressure shown in Fig 17. The agreement between the two 
types of measurement is satisfactory and confirms the fact that 
the ear is sensitive to extremely minute movements Even 
slightly less movement may occur at the oval window and 



*0 50 100 ZOO 500 1000 ZOOO 5000 tftOOO 


FREQUENCY 

Fic 18 The circles show the amplitude of vibration of the eardrum at 
threshold, as determined by Wilska The curve represents the calculated 
amplitude of the air molecules in a sound wave at threshold pressure Where 
the ear is most sensitive the amplitude of vibration of the eardrum is less than 
the diameter of a hydrogen molecule. 

Within the cochlea for threshold stimulation. Consequently, it 
appears that near 3000 cycles vve are able to detect a displace- 
ment of the basilar membrane equal to about 10" 10 cm Tins 
distance is less than 1 per cent of the diameter of a hydrogen 
molecule ! 

THEORETICAL LIMITS OF AUDITORY 
SENSITIVITY 

In view of the very small amount of acoustic energy needed 
to excite a sensation of hearing in a good ear, the question arises 



THEORETICAL LIMITS OF AUDITORY SENSITIVITY 


57 


as to whether the sensitivity o£ the ear is limited by its construe 
tion and its physiological efficiency or whether the limit is lm 
posed by the nature of the air as a transmitting medium for 
sound We know, from experiments on Brownian movement, 
that the individual molecules of the air are constantly in random 
agitation with a violence dependent upon the temperature of 
the air On any surface exposed to the air, therefore, there are 
tiny periodic fluctuations in pressure, caused by the irregular dis 
tribution of thermal velocities among the air molecules The 
result is a spectrum of thermal acoustic noise in which all fre 
quencies are represented The question then is Are the fre 
quencies to which the ear is most sensitive present in the thermal 
noise at an intensity great enough to be heard ? 

An approximation to the solution of this problem was 
worked out by Sivjan and White along lines analogous to the 
method used in determining the electromotive force produced 
by the thermal agitation of electric charges in conductors The 
random motion of the charges on the atoms composing an elec 
tnc conductor and the random motion of air molecules produce 
analogous effects By limiting the consideration to pressures 
generated within a limited band of frequencies, it is possible 
to calculate these pressures Thus, calculation shows that be- 
tween the frequencies 1000 and 6000 cycles the root mean square 
pressure due to thermal agitation is about 86 db below 1 dyne 
per square centimeter Throughout that region the minimum 
audible field averages about 76 db, but in cases of persons with 
particularly excellent hearing it may average about 85 db below 
1 dyne per square centimeter The calculations of the pressure 
due to the thermal noise in air are admittedly crude, but they 
■rcrvt, nevenhe’iess, to demonstrateifoat, infne region oi maxima 1 ! 
sensitivity, the minimum audible field for a good ear has a 
pressure of the same order of magnitude as the thermal acoustic 
pressure at ordinary temperatures For exceptionally acute 
ears, therefore, a further increase in sensitivity would be useless 
in the face of the normal noise continuously present in the air 
This fact makes it highly unlikely that there should be animals 
having appreciably greater auditory sensitivity than man in the 



58 


THE SENSITIVITY OF THE EAR 


region between 1000 and 6000 cycles, for they too would be 
limited by thermal noise 

A problem analogous to that of thermal noise has been 
raised in regard to differential visual acuity Barnes and 
Czerny conclude that there is evidence to show that the human 
eye, in that region of the visual spectrum to which it is most 
sensitive, has a differential sensitivity of the S3tnc order of rmg 
mtude as the fluctuations inherent in a ‘steady’ light due to the 
shot effect' in photon emission Nature has apparently pro- 
duced sensory receptors m man, whose effectiveness is limited 
only by the quantal nature of the phenomena they arc designed 
to discriminate 

THE THRESHOLD OF FEELING 

As the intensity of an audible sound is increased, a point is 
reached at which the listener experiences a nonauditory tactual 
sensation This is usually described as * feeling/ but its nature 
vanes considerably with frequency and somewhat with ob 
servers At lower frequencies a gentle but definite vibration 
is experienced which is quite distinct and superimposed on the 
sound In some cases, however, “dizziness is described, sug 
gesting excitation of the semicircular canals At higher fre 
qucncics the sensation is likely to be one of sharp pain 

A determination of this threshold of feeling, by allowing 
the observer to increase the intensity until the extra auditory 
sensation appeared, gave the results shown by the circles m Fig 
19 (Wcgcl, 2) This threshold corresponds closely to the loud 
ness level of 120 db (see p 124), and may be taken as defining 
the upper limit of hearing The area then, between the 
threshold of feeling and the threshold of hearing, shown by 
the lower curve in Fig 19, is known as the auditory area It 
delineates the audible range of frequencies and intensities when 
measured in terms of sound pressure at the eardrum 

The section of the threshold curve for hearing lying below 
50 cycles is due to Bekcsy (22), who reports auditory phenomena 
) at astonishingly low frequencies In addition to the purely 
auditory sensations, which exhibit the usual phenomena of 



THE THRESHOLD OF FEELtNG 


59 


localization (see p. 46), Bekesy describes three other types of 
sensation, whose thresholds are plotted in Fig. 19. When, at 
10 cycles, the intensity in the two ears is increased about 40 db 
above the threshold of hearing, one experiences a tactual sensa- 
tion which is definitely localized at the ears and cannot be 














m 


■nil 

■ 






■ 

3 

■ 

K 

a»i 


n 



a 



■ 

■ 


■BTTYTirrTM 

■ 







■ 



i 




■ 




■ 



AIM 










■ 








■ 

■ 



■ 

■ 





3 

■ 



■ 


U 

■ 



■ 

■ 





■ 

■ 

K 

a 








s 

— 

m 


« 2 4 K> 20 100 300 2000 K>000 

FREQUENCY 


Fig 19 The auditory area, which Jies between the threshold of hearing and 
the threshold of feeling The threshold of hearing represents minimum 
audible pressure. Wegcl (2) determined the talues for the threshold of feel 
mg The \ertical lines represent the scatter of his obsen ations The other 
curves were determined by Bekesy (22) 


shifted by unequal intensities in the two ears. This threshold 
is represented by the dotted curve in Fig 19. At frequencies 
below 1 cycle, the tactual sensation appears before the auditory. 
For extremely slow frequencies, such as 1 cycle in 30 sec, the 
tactual threshold is reached at about the same value of pressure. 
Furthermore, this tactual sensation, although it resembles phe- 
twto.ma.UY a. ttos&ion, of presswxe *ao. the up of the fidget, sAyaws, 
extremely slow adaptation and persists for several minutes when 
a constant pressure of about 6000 dynes per square centimeter 
is applied. 

Stimulation at still higher intensities, at frequencies below 
10 cycles, arouses a definite pricking sensation which synchro- 
nizes with the maximum of the pressure-wave and which ap- 
pears to be localized much deeper in the ear than the tactual 
sensation. At frequencies above 20 cycles, the pricking passes 



60 


THE SENSITIVITY OF THE EAR 


over into a sensation of tickle and completely masks the tactual 
sensation Undoubtedly, this sensation of tickle corresponds 
to what was reported by Wcgel (2) as “feeling ” It shows very 
little dependence upon frequency or wav e form, but arises at 
any frequency when a sufficient amplitude is reached Further 
increase in intensity changes the sensation from a tickle to an 
itch, and the itching may persist for several minutes after the 
tone has ceased The persistence of the tickle, however, is only 
about 002 sec Finally, Bfkfsy reports that long-continued 
stimulation at intensities above the threshold for tickle produces 
a painful burning sensation which resembles the burning pro- 
duced when the skm is rubbed severely When this burning 
occurs, the experiment must be discontinued 1 

AUDIOMETRY 

A reliable method for making accurate measurements of the 
threshold of hearing for tones of various frequencies is of great 
value to otologists, clinicians, school teachers, and many other 
persons Consequently, the more picturesque rulc-of thumb 
methods for testing hearing, such as whispering to the patient 
or holding a watch at various distances from his ear, have given 
way to the audiometer The commercial audiometer is de 
signed to determine the acuity, and, to some extent, the quality 
of hearing It consists of a vacuum tube oscillator equipped 
to produce tones of several fixed frequencies at definitely meas 
urable intensities These tones are generated at the ear of the 
listener by a receiver which has previously been calibrated in 
terms of the hearing of a ‘normal’ ear Then, b> measuring 
the amount by which the intensity of a gnen tone must be in 
creased above the ‘normal intensity in order for the patient to 
hear the tone, the investigator obtains a measure of the impair- 
ment of hearing for that tone The test tones usually have 
frequencies spaced an octave apart throughout the audible 
range, and the * normal’ intensity at each frequency is deter 
mined from measurements on a large group of young people 
) The standard procedure for testing hearing requires that the 
patient hold the receiver snugly to his car and press a signaling 



THE AUDIOGRAM 


61 


button during all the time he hears a tone. The button lights 
a lamp. Then, at each frequency in turn, the intensity is ad- 
justed until it reaches the lowest value for which the patient is 
able consistently to signal that he hears the tone. This value 
of the intensity is taken as the threshold, and the relation of 
this value to the ‘normal’ intensity determines the patient’s rela- 
tive acuity. In order to facilitate the determination of tins 
relation, the dial by which the intensity is controlled is calibrated 
directly in decibels (sometimes called “sensation units’’), so 
that the operator obtains a direct reading of the number of 
decibels that the patient’s threshold is abo\ e or below normal. 



Fic 20 An audiogram for a person suffering from high tone deafness 
The circles are for die right ear, the crosses for the left. The dotted curve 
represents a hearing loss equal to 100 per cent at all frequencies The useful 
auditory area is enclosed between this curse and the zero ordinate The zero 
ordinate represents normal hearing 

THE AUDIOGRAM 

Results of audiometric measurements are usually plotted on 
what is called an audiogram. A sample audiogram, obtained 
with a commercial audiometer, is shown in Fig. 20. The ab- 




62 


THE SENSITIVITY OF THE EAR 


scissa represents the frequency of the tones presented, and the 
ordinate measures, in decibels, the amount by which hearing 
is below normal The dotted curve represents, approximately, 
what is known as the threshold of feeling When a tone is 
made sufficiently intense, the observer experiences a tactual 
( feeling ) sensation, and the distance on the audiogram from 
the line representing normal hearing to the dotted curve is a 
measure of the intensity above threshold at which feeling 
occurs The area between the normal line 3nd the dotted curve 
is commonly referred to as the auditory area Another way of 
plotting the auditory area is shown in Fig 19 

Clearly, if a person is unable to hear a particular tone, even 
when the intensity is made so great that his threshold of feeling 
is reached, his hearing loss at that frequency, for all practical 
purposes, is 100 per cent On the other hand, if he hears the 
tone when its intensity is raised to a value midway between the 
normal threshold and the threshold of feeling his hearing loss 
is only partial This procedure for determining hearing loss 
suggests a method for designating the state of a patient s hear 
ing Thus if for a tone on the audiogram in Fig 20, the in 
tensity which the listener is just able to hear lies halfway be 
tween the normal threshold and the threshold of feeling we 
say the hearing loss at that frequency is 50 per cent In other 
words, one hundred times the ratio of the distance of the 
threshold point from the normal line to the distance of the 
normal line to the threshold of feeling is taken as the percent 
age of hearing loss at that frequency The figures at the 
bottom of Fig 20 give the percentage of loss for the two ears 
whose audiograms are plotted 

It must be borne in mind that this manner of designating 
hearing loss is based upon an arbitrary procedure, and, al 
though the method is convenient and standard care is needed 
in its interpretation The ordinate of the audiogram is a 
logarithmic scale since it is measured in decibels Hence, to 
say that a person has a 50-pcr-cent hearing loss does not mean 
that the energy of the stimulus must be raised to 50 per cent of 
its value at the threshold of feeling m order for him to hear 


TONAL LACUNAE 


63 


the tone. (Actually, the threshold energy for hearing, of a per- 
son with a 50-per-cent hearing-loss, is equal to the square root 
of the energy at the threshold of feeling ) 

It is commonly supposed that the logarithmic scale of sensa- 
tion units (decibels) is a true measure of the subjective loudness 
of a tone, and that the measurement of hearing loss m sensation 
units is essentially a valid subjective measure. Loudness, how- 
ever, is not proportional to the logarithm of the stimulus — the 
Weber-Fechner law does not hold here (see Chapter 4) Con 
sequently, a hearing-loss of 50 per cent does not mean that the 
loudness of a tone, as judged by a normal observer, must be 
increased to 50 per cent of its loudness at the threshold of feeling 
in order to be heard by the patient. In fact, recent determina- 
tions of values of subjective loudness demonstrate that a tone 
of 1000 cycles which, on the audiogram in Fig 20, is halfway 
between the normal line and the threshold of feeling has a loud- 
ness, for a normal listener, which is only about 1 per cent of 
the loudness of the same tone at the threshold of feeling 

Clearly, then, the percentage scale used for measuring 
hearing loss agrees neither with the physical measure of the 
intensity of the stimulus nor with the subjective measure of the 
loudness of a tone. It is, however, a convenient scale, since it 
has all of the advantages of the decibel scale (discussed in 
Chapter 1), and its widespread use by those interested in clinical 
studies warrants its retention 

TONAL LACUNAE 

Tonal lacunae are commonly understood to be isolated re- 
gions of frequencies to which the ear is not sensitive. The sen- 
sitive regions between tonallacunae are called tonal islands. In 
most cases of supposed insensitivity to certain frequencies, it is 
found that, by increasing the intensity of the stimulating tone, 
a value is found which results in a sensation of hearing In 
other words, tonal lacunae turn out to be regions of relative 
rather than absolute insensitivity. The audiogram for the 
left ear in Fig 20 illustrates this point. At 4096 cycles the 
hearing-loss, in decibels, in the left ear is greater than at fre- 



6 A 


THE SENSITIVITY OF THE EAR 


quencies an octave removed on either side If this ear were 
being tested with a tone 60 db above threshold, a range of jn 
audible frequencies would be found near 4096 cycles This 
tonal lacuna would disappear, however, as soon as the intensity 
was increased to 70 db above threshold 

SENSITIVITY BY BONE CONDUCTION 

The determination of normal thresholds for hearing by 
bone-conduction is of great practical importance to the otologist 
Unfortunately, the nature of the process — the application of a 
small vibrating diaphragm to the mastoid bone behind the 
pinna— precludes the specification of bone-conduction thresh 
olds in simple energy units, such as arc used in hearing by air 
conduction The amount of energy delivered to the car by 
bone-conduction is difficult to determine It is, of course, possi 
blc to measure the power delivered to the bone-conduction 
receiver, but how much of this power is passed on to the auditory 
mechanism depends upon the type of receiver used, and the 
manner of its application to the head of the subject For testing 
a person with normal hearing, the external ear-canal must be 
stopped off, but the question remains as to how much sound 
energy reaches the eardrum of the hstener by what is essentially 
air-conduction, l e , transmission through the stopped meatus 

If wc assume that the sound heard at threshold reaches the 
ear exclusively by bone-conduction, we can then proceed on an 
empirical basis to the construction and calibration of a bone 
conduction audiometer As with the audiometer for determm 
mg sensitivity to tones by air-conduction, the instrument can be 
calibrated in terms of the hearing of a large number of normal 
observers, and hearing loss m abnormal ears can then be meas- 
ured in terms of the amount by which the pou cr delivered to 
t’nc 'Done-concmction receiver must 'pc augmented in orfier "to 
excite a sensation of hearing The bone-conduction audiometer 
is, in the hands of the otologist, an important diagnostic aid 
for determining causes of deafness (see Chapter 11) 



SENSITIVITY TO ELECTRICAL STIMULATION 


65 


SENSITIVITY TO ELECTRICAL STIMULATION 

The auditory sensitivities discussed thus far have pertained 
to some form of mechanical stimulation It is possible, how 
ever, to elicit a sensation of hearing by delivering electrical 
rather than mechanical energy to the ear In other words, 
when an alternating electric current is passed through the head 
of an observer, he hears, under proper conditions, a tone whose 
pitch depends upon the frequency of the current This phe 
nomenon has been called the electrophone effect (Stevens, 8) 

Electrophonic thresholds have been determined by measuring 
the minimal power needed, at various frequencies, to arouse 
an auditory sensation, with electrodes applied m a standardized 
manner One electrode consisted of a metal plate in contact 
with the mside of the observer s forearm In order to make 
contact with the other electrode at the ear, the external meatus 
was filled with a salt solution and a bare wire was immersed 
about 5 mm m the sotution The two electrodes were then 
connected to a beat frequency oscillator, through an ammeter 
and vacuum tube voltmeter, so that the voltage and current 
delivered to the observer could be measured simultaneously 

As is well known, the body presents to an alternating cur 
rent a complex impedance which has resistive and capacitive 
components Therefore, in order to measure the power dissi 
pated, the equivalent resistance and reactance of the portion of 
the body in the path of the alternating current must be deter 
mined This can be done by connecting the two electrodes 
to an impedance bridge, and measuring the resistance and the 
reactance of the observer The resistance and the capacitive 
reactance decrease with increasing, frequency From the values 
obtained from the impedance bridge it is possible to calculate 
the power factor of the observer — the ratio of the resistance to 
the total impedance — at each frequency, and then to calculate 
the actual power consumed when a threshold current is passed 
through the body 

The results of a senes of threshold measurements, on observ 
ers whose hearing by air-conduction was tested by means of an 
audiometer and found to be normal, are shown in Fig 21 



66 


THE SENSITIVITY Of THE EAR 


(Stevens, 8). The lower curve represents the power needed to 
reach the threshold of hearing; the upper curve shows the point 
at which an electric shock is felt. The fact that an increase 
above threshold of about 20 db in the intensity of the stimulus 
gives rise to a combined burning, tickling, and prickling sensa- 
tion severely limits the loudness which may be achieved through 
electrical stimulation. The ‘auditory area* under electrical 



alternating current u passed through the head A sensation of electric shock 
is experienced at values given by the upper curve. The usable auditory area’ 
under clean cal stimulation is the area between these curves (Stevens, 8 ) 

stimulation is much smaller than under conditions of normal 
stimulation by sound-waves in air. The electric shock also im- 
poses limits to the range of frequencies which can be detected, 
for, when the frequency of a pure tone is as low as 200 cycles, 
most observers experience shock before they hear the tone. 
Similarly, at frequencies above 10,000 cycles, the threshold for 
shock is usually less than for hearing Nevertheless, as shown 
in Fig 21, observers sometimes hear tones as low as 125 cycles 
and as high as 12,000 cycles In hearing a tone as low as 125 
- cycles, there appears a combined auditory, tactual, and pressure 
■) sensation in the ear, which observers report as a “strange expe- 
/ nence." 

If the wave-form of the low tone is extremely non sinusoidal, 





RELATION OF SENSITIVITY TO AGE 


67 


the threshold frequency may be considerably lowered In fact, 
when the wave form approximates that of a senes of sharp un 
pulses, the observers hear a corresponding series of clicks at the 
frequency of the impulses, regardless of how low that fre 
quency may be This fact probably explains the report of some 
experimenters (Gersuni and Volokhov) that thresholds have 
been measured for tones as low as 17 cycles The tones were 
apparently not produced by sinusoidal waves 

In the measurement of thresholds for electrical stimulation, 
we face the same problem as for bone conduction, that is, we 
cannot determine precisely how much energy is delivered to 
the auditory mechanism itself The ideal measure of the elec 
trophomc thresholds would be the amount of power dissipated 
in the particular mechanism by which electrical energy is 
transformed into acoustical energy, but, since there is, at pres- 
ent, no way of discovering how much of the total energy dis- 
sipated between the two electrodes is consumed m hearing, the 
best measure appears to be the total power used when the elec 
trodes are applied in some standardized manner Probably 
only a small fraction of this total power is consumed in the 
auditory mechanism— only enough, in fact, to set up minute 
mechanical vibrations in the basilar membrane (see p 352) 

RELATION OF SENSITIVITY TO AGE 

Like most other organs of the body, our ears grow old and 
lose some of their effectiveness The auditory mechanism is 
peculiar, however, in that it tends to lose its sensitivity to certain 
frequencies, but not to others Thus, we find that older peop' 
grow increasingly less sensitive to tones of high frequency — 
above 1000 cycles — but retain their hearing for low tones ex 
cellently 

An extensive series of audiometer tests on 353 hospital pa 
tients was carried out by Bunch The results are classified for 
age groups by decades, and, as shown in Fig 22, the outstanding 
result is the marked decrease m acuity with advancing age 
The label on each curve indicates the lower limit of the decade 



68 


THE SENSITIVITY OF THE EAR 


The number of patients in the successive decades was 68, 70, 78, 
85, and 52. 

Another scries of tests was conducted by Montgomery (I) on 
200 people, ranging in age from 20 to 60 years. These tests 
also indicate decreasing sensitivity with advancing age, though 
not so markedly as in Bunch's work. The difference may be 



Fig. 22. The audiograms of people at different agc-tevcU. The ordinate 
records the hearing loss, in decibels, relative to the hearing of people whose 
ages he between 20 and 30 years (zero ordinate). (After Bunch.) 


attributed to the fact that the hospital group, examined by 
Bunch, included more people whose ailments accentuate the 
normal effects of advancing age than would be found in a 
healthy group, such as Montgomery investigated. Montgomery 
found that the difference between the 20-29 and the 50-59 
decades is about 7 db at 2048 cycles and 23 db at 8192 cycles. 





CHAPTEH 3 


PITCH 

Pitch is one of the psychological aspects, or attributes, of tones 
It is one of the dimensions in terms of which we are able to 
distinguish and classify auditory sensations Thus we com 
monly describe high frequency tones as high pitched tones, and 
to tones of low frequency we attribute a low pitch This desig 
nation of pitches in most European languages by words mean 
mg high or low appears to have some basis in phenomenal 
experience When, observers are asked to localize the apparent 
source of tones produced behind a screen, they are likely to 
attribute a higher locus to the high pitched tones than to the 
low, even though the actual source of the tones remains un 
changed (Pratt, 3) 

The scale on which wc arrange the pitches in order is gen 
erally assumed to have definite lower and upper limits — cor 
responding approximately to the frequencies of 20 and 20,000 
cycles The lower limit for auditory sensation is not necessarily 
the lower limit for the perception of pitch Tonal pitch has 
been reported to arise quite suddenly at 18 cycles, whereas some 
sort of hearing may be possible near 1 cycle (see Chapter 2) 
The lower limit for pitch is difficult to determine with precision, 
for two reasons First, there is the problem of distinguishing 
between a very low frequency which is heard as a tone and one 
which is heard simply as a senes of distinguishable pulsations 
Second, the ear itself introduces so much distortion (the pro- 
duction of aural or ‘subjective’ harmonics) at these low fre 
quencies that the task of distinguishing between the perception 
of the fundamental tone and the hearing of higher harmonics 
becomes difficult For certain low frequencies, the aural har 
monies of an originally pure tone may have a higher sensation 
level than the fundamental itself, and the harmonics may 
possibly be heard as tones when the fundamental is too faint 
69 



70 


FITCH 


to be heard at all (see p 187) The upper limit of hearing, for 
a given individual, can be determined with greater precision, 
but in this case the marked differences among persons and the 
striking effect of age upon the upper limit prevent any attempt 
to fix a ‘normal’ upper limit (see p 68) 

Pitch is a concept determined by the direct response of a 
human observer to a sound stimulus Frequency, on the other 
hand, is determined by an observer who uses the instruments 
of physical observation and measurement That is to say, fre 
quency is measured with the help of instruments, pitch is a 
direct perception It is necessary to stress this difference be 
tween the two concepts, because of the persistent tendency to 
confuse pitch with frequency Physicists have generally used 
the two words interchangeably, on the false assumption that 
experienced pitch is uniquely determined by the frequency of 
the stimulus Thus Barton, in his excellent treatise on sound 
says, * The pitch of a musical sound depends upon the 
period or frequency of the vibrations constituting the sound 
and upon that alone 

THE RELATION OF PITCH TO INTENSITY 
Many investigators, during the last hundred years, have 
noted an apparent change in the pitch of a tone with a change 
in intensity (Zurmuhl) An instance of this effect (Miles) 
appears when observers are required to reproduce vocally the 
pitch of a tuning fork (middle C) When the fork is held 
close to the ear so that the intensity is increased, the pitch of 
the singer s voice is lowered slightly In other words, the 
observers hear the louder tone as lower 

In order to determine quantitatively the effect of intensity 
upon the pitch of tones throughout the audible range, a survey 
was made of the phenomenon for 11 frequencies ranging from 
150 to 12000 cycles (Stevens 5) Two tones of slightly dif 
ferent frequency were presented alternately to an obsen er He 
was allowed to adjust the intensity of one of the tones until 
the pitch of the two tones appeared equal In other w ords, the 
observer compensated for a difference in frequency by means 



THE RELATION OF PITCH TO INTENSITY 


71 


o£ a difference in intensity, and thereby made the two tones 
sound equal in respect o£ pitch. 

The results of extensive observations made with one observer 



INTENSITY IN OB (ZERO DB I DYNE/CM*) 

Fig 23 Contours showing how pitch changes with intensity The per 
centage change in frequency necessary to keep the pitch of a tone constant in 
the face of a given change m intensity can be taken as a measure of the effect 
of intensity upon pitch Pitch in this case is the parameter, as indicated by the 
numbers attached to the curves The ordinate scale was arbitrarily chosen so 
that a contour with a positive slope shows that pitch increases with intensity 
(After Stevens 5) 

who was exceptionally accurate are shown in Fig 23 These 
curves we may call equal-pitch contours. They show the rela- 



72 


PITCH 


lion between frequency and intensity which must be maintained 
m order to keep a tone at a constant level of pitch Hence they 
show what happens to the pitch of tones of various frequencies 
when wc alter the intensity For low tones, the pitch de- 
creases w «h intensity, but, for high tones, the pitch increases 
with intensity For certain tones in the middle range, both 
effects are present to a slight degree Thus at 2000 cycles, the 
pitch, for this observer, increased up to about 60 db above 
threshold and then decreased At other frequencies this point 
of reversal occurred at different intensities In general, the 
higher the frequency, the higher the intensity at which the 
reversal takes place 

Now, casual inspection of Fig 23 shows that the frequencies 
at which the change of pitch with intensity is least arc those to 
which the ear is most sensitive, as shown in Fig 17 (p 50) 
Apparently the point of maximal sensitivity of the ear is the 
point at which the reversal of the direction of the pitch-change 
occurs Then, the tendency of the reversal to take place at 
higher frequencies when the intensity is raised, as shown by the 
maxima of the middle group of curves in Fig 23, should mean 
that the frequency to which the ear is maximally sensitive is a 
function of intensity This inference is borne out, in a quahta 
tive manner, by a study of the equal loudness contours shown 
in Fig 45 (p 124) The minima of those curves represent the 
points of maximal sensitivity at different loudness-levels There 
is clearly a slight shift of the minima in the direction of the 
higher frequencies as the intensity is increased 

In this phenomenon wc have a case of a physical system (the 
ear) whose frequency-characteristic is a function of the magni 
tude of the driving force That is to say, the resonant frequency 
of the total auditory mechanism, as represented by the mintma 
of the equal loudness contours, is not the same frequency at all 
intensities This fact means that, as the intensity is raised, some 
mechanism comes into action which has a selective effect upon 
different frequencies Apparently this mechanism attenuates 
the response of the ear to low but not to high frequencies, as 
the intensity is raised Now, a mechanism which has precisely 



THE RELATION OF PITCH TO INTENSITY 


73 


this effect is the musculature of the middle ear (see p 266) 
Tension on the tensor tyrapani, which occurs reflexly in re 
sponse to loud sounds, serves to tighten the eardrum and to 
impede the transmission of low tones This selective attenua 
tion of low frequencies at high loudness levels may well account 
for the shift in the point of maximal sensitivity of the ear, and 
presumably makes it reasonable to expect the shift in the 
maxima of the pitch-contours of Fig 23 

The foregoing discussion rests upon the assumption that the 
forms of the loudness- and pitch-contours depend upon the 
response-characteristics of the auditory mechanism when it is 
viewed as a total system We may thus inquire what is the 
relation of the change of pitch with intensity to the behavior 
of the mechanism in the inner ear — the basilar membrane In 
terms of the mechanics of the basilar membrane (Chapter 10), 
the change of pitch with intensity is represented by a shift in 
the position of maximal stimulation on the membrane Since 
high tones become higher and low tones become lower with 
increased intensity, it is clear that the stimulation moves out 
toward the ends of the membrane, for high and low tones 
What happens for tones of the middle range ? Here the shift 
is closely related to the point of maximal sensitivity of the ear, 
and the shift is always such as to move the region of maximal 
stimulation on the basilar membrane away from the point which 
is stimulated by the frequency to which the ear is most sensitive 
In other words, as the intensity of tones is increased, their pitch 
is shifted away from the pitch of the tone to which the ear is 
maximally sensitive 

Here we have a phenomenon hearing a sinking resemh’iance 
to the Bezald Brucke effect in vision, according to which altera 
tion of the intensity of a visual stimulus produces a change of 
hue The analogy holds even further, however, for just as there 
arc frequencies at which pitch is invariant with intensity, so 
likewise does hue remain invariant with intensity at certain 
wave lengths of light 

A possible explanation of the change of pitch with mtensity 
is presented on p 349 



74 


PITCH 


CHANGE OF PITCH FOR VERY LOW TONES 

An investigation of the behavior of the pitch of tones below 
300 cycles (Snow) shows that the low er the tone, the greater the 
change of pitch for all tones above approximately 100 cycles 
Below 100 cycles the magnitude of the pitch-change decreases, 
but the direction of the change remains the same. This gen- 
eralization must be modified for extremely loud tones, for at 
high loudness-levels (above 105 db) the greatest change of 
pitch appears to occur at about 200 cycles 

The comprehensive set of curves shown in Fig 24 were 
drawn to illustrate the behavior of the pitch of low tones These 



by which the pitch of a pure tone of any frequency is shifted as the tone u 
raised from a level of 40 db to the level indicated on the contour Example 
the pitch of a 100-cycle tone is lowered 10 per cent when its loudness level is 
increased from 40 db to 100 db, but the pitch of a 500-c) de tone is lowered 
by only 2 per cent by the same change in loudness-level (Snow ) 

are probably the most representative curves that can be con- 
structed on the basis of present evidence. Our ability to deter- 
mine precise functions for the phenomenon of change of pitch 
is restricted by the fact that large differences appear among the 
results obtained with different observers Nor is there great 
agreement between the results obtained from the same observer 
on different days It is an interesting fact, how ever, that those 
) observers, in two different experiments, * ho show ed the largest 




CHAlsGE OF PITCH FOR MUSICAL TONES 


75 


changes of pitch v»ith intensity were the most consistent in their 
judgments (Stevens, 5, Snow) 

On the basis of the hypothesis previously laid down to 
explain the relation of pitch to intensity, namely, the displace 
ment of the pattern of cochlear excitation toward the end of the 
basilar membrane, it is reasonable to expect that the magnitude 
of the change of pitch would decrease at very low frequencies 
When the excitation is already close to the end of the membrane, 
further shift m that direction should be difficult There is also 
the possibility that at frequencies below 100 cycles (Steinberg) 
the perception of pitch does not depend in a simple manner 
upon a point of maximal stimulation, because the entire apical 
end of the membrane may be involved for these low tones (see p 
375) In this event anomalous behavior of the pitch of low 
tones is to be expected 

The argument may well be made that we have, in this 
phenomenon of change of pitch with mtensity, a means of find 
mg the frequency below which spatial localization of excitation 
on the basilar membrane is not the important determinant of 
pitch Spatial differentiation (the ‘place theory’ of pitch) 
should not hold below the lowest frequency at which intensity 
has an effect on pitch Unfortunately, this point has not yet 
been determined 

CHANGE OF PITCH FOR MUSICAL TONES 

It must be emphasized that the pitch changes shown in Figs 
23 and 24, which may seem very large, were obtained with pure 
tones These changes, of more than a full musical tone in many 
cases, would undoubtedly have strange effects upon musical 
renditions if pure tones were used Fortunately, however, the 
complex tones produced by most musical instruments suffer 
only very slight changes of pitch with mtensity Thus when 
four skilled musicians were asked to play a certain interval on 
a violin, first very softly and then very loudly, the relation 
between the objective frequencies constituting the interval was 
not significantly different in the two cases Smce the players 
were judging the intervals in terms of subjective pitch, this 



76 


PITCH 


result seems to indicate that the pitch was not changed by m 
tensity (Lewis and Cowan) 

Additional evidence that the pitch of impure tones is rcla 
lively stable was obtained by comparing the change of pitch in 
a 5 partial tone (fundamental frequency of 200 cycles) with the 
change in a pure tone of 200 cycles The change in the pure 
tone was about 5 times as great (Fletcher, 3) This difference 
between pure and complex tones apparently depends upon the 
fact that the complex tone contains, as partials, those frequencies 
whose pitch, as shown m Fig 23, changes very slightly with in 
tensity Perhaps these partials control the magnitude of the 
apparent change of pitch when the complex tone is varied in 
intensity 

Finally, it is of great musical interest to determine whether 
the harmonious relation of two tones depends upon the pitch or 
upon the frequency of the tones Will two tones constituting 
an octave when sounded separately be harmonious when 
sounded together ? The answer, of course, depends upon the 
intensive relations involved For example, a soft tone of 300 
ejeles may appear to be an octave higher than a loud tone of 
168 qcles, but when they are sounded together they are very 
discordant The reverse can also be demonstrated, that is, two 
tones which appear to be slightly different in pitch may, when 
sounded together, produce a harmonious result (Fletcher, 4) 
We must conclude, therefore, that tonal combinations will be 
harmonious or not, depending upon the frequencies rather than 
upon the pitches of the components 

RELATION OF PITCH TO FREQUENCY 

From the foregoing discussion it is evident that the fre 
qucncy of a tone does not uniquely determine its pitch Hence, 
yti VpKsfyjWg sivt y.W'dv <v£ i. towt, w •& ■dttAssfck to sefee to tfefc 
pitch of tones at some standard level of loudness A convenient 
standard is the 40-db loudness-level (Fletcher, 3) It is cus- 
tomary to designate the pitch of a tone by the number of cjdes 
per second (the frequency) of that tone, at the loudness-lcvcl 
of 40 db, which sounds equal in pitch to the given tone This 



RELATION OF PITCH TO FREQUENCY 


77 


procedure is a partial recognition of the fact that pitch is de- 
termined by the responses of living organisms to sound stimuli, 
but it has the defect of employing the numbers of the stimulus- 
scale (frequency) to represent an aspect of sensation (pitch) 
A desirable scale for the pitch of tones at the reference loudness- 
level (40 db) would be one expressed in numbers whose values 
are directly proportional to the magnitude of the perceived 
pitch 

One might suppose that the musical scale, which divides the 
range of audible frequencies into octaves, and these m turn into 
tones and semitones, w ould serve as a pitch scale The musicat 
scale, however, is inadequate, because tw o equal musical inter- 
vals in different parts of the frequency range do not constitute 
equal subjective intervals (see p 83) 

Our ability to construct a scale which will have the desired 
characteristic of representing the magnitude of perceived pitch 
by numbers proportional to that magnitude depends upon our 
ability to devise operations for the measurement of aspects of 
sensation Controversy has for a long time centered upon the 
proposition that it is possible to measure the attributes of sensa 
tion, or to tell when one sensation is twice or three times as great 
as another The truth of this proposition must depend, not 
upon a priori conceptions, but upon the outcome of experiment 
There are, however, certain considerations pertaining to all 
sensory scales which we shall now digress to consider 

The Problem of Sensory Scales We devise scales for the 
purpose of facilitating the description of natural phenomena in 
terms of functional relationships, expressed, if possible, by the 
sjmbols of conventional mathematics Scales, m other words, 
are constructed for a purpose The intended use of a scale, then, 
becomes the most important factor in determining the criteria 
which the scale must satisfy in order to be considered adequate 
In any particular case, we must first decide what sort of scale 
we want, and then determine by experiment whether or not 
such a scale can, in fact, be constructed In other words, we 
must decide upon the criteria of the scale, and then devise opera- 
tions for satisfying the criteria 



78 


PITCH 


In the construction of sensory scales, the following considers 
lions are particularly important 

1 There are, in general, tw o types of scales They m3y be 
designated as intensive and numerical scales Scales which 
measure intensive magnitude enable us to place the things meas- 
ured in a rank-order, i e , arrange them according to increasing 
magnitude Such a scale does not, however, enable us to say 
how many tunes one magnitude is greater than another, but 
only that it is greater Numerical scales, on the other hand, 
have numbers which express the numerical relations between 
things measured Thus when two magnitudes are measured 
in terms of a truly numerical scale, the scale numbers can be 
manipulated in accordance with arithmetical laws in order to 
determine additional relations, such as the sum of two magm 
tudes or the relative separation of two pairs of magnitudes 

2 These manipulations of the numbers on numerical scales 
have significance only if the manipulations correspond to a set 
of concrete operations which can be performed on the things 
measured Otherwise, the validity of the outcome of the manip- 
ulations cannot be tested empirically If, for example, we add 
a magnitude of 2 scale units to another magnitude of 2 scale 
units, and conclude that we have a magnitude of 4 units, our 
statements are of no empirical significance unless w e have some 
concrete operations for adding the quantities that the scale 
measures in order to verify the result The operations will, in 
general, be different for different things Thus, the procedure 
for adding 2 meters to 2 meters to make 4 meters is very unlike 
that for showing that 2 henries of electrical inductance can be 
added to 2 henries to give 4 henries Similarly, in the case of 
sensation we may reasonably expect to find that numerical scales 
are based upon operations peculiar to it alone 

3 Now, m psychological measurements, most scales hare 
been scales of intensive magnitude from which rank-orders, 
but not numerical relations, could be obtained What we 
should like to have are numerical scales — scales whose numbers 

J represent some aspect of the response of a living organism to a 
class of stimuli, like sound 



RELATION OF PITCH TO FREQUENCY 


79 


4 The numbers on the numerical scale must be applied 
to the attnbute of sensation (which is, of course, an aspect of 
an organism’s response) in such a way that when they arc 
manipulated according to the rules of arithmetic, one obtains 
a result which can be verified observationally To the manip- 
ulations and to the results there must correspond a set of con- 
crete operations Since the basic operation for determining a 
sensory scale is that of presenting a stimulus to an observer and 
noting his response, the results of manipulating the scale num- 
bers must, in general, be tested in terms of the ‘experience’ 
(response) of an observer With such a scale the operation of 
addition might consist in changing the stimulus until the ob- 
server gives a particular response which indicates that a given 
relation of magnitudes has been achieved In other words, a 
sensory scale is a scale of response, and the response of an 
observer who says * this is half as great as that ’ is one which, 
for the purpose of erecting a subjective scale, can probably be 
accepted at its face value 

5 Although we could, for different purposes, choose any 
one of several sets of operations as defining the scale, that set 
will generally prove most satisfactory which leads to scale num 
bers bearing the most reasonable relation to the experience of 
the observer A reasonable scale is one for which the number N 
stands for a sensation which does in fact appear to be half as 
great as that represented by the number 2 N, etc 

A Numerical Scale of Pitch On the assumption that ob 
servers are able to tell when one tone is half as high in pitch as 
another tone, a numerical scale of pitch has been constructed 
from determinations of the half value of the pitch of tones of 
several frequencies (Stevens, Volkmann, and Newman) The 
observer was presented alternately with two tones at a loudness 
level of 60 db One tone was fixed in frequency The other 
could be varied in frequency by the observer until its pitch 
appeared to him to be half of that of the fixed tone This 
procedure is called the method of fractionation Ten different 
frequencies were used as the fixed tone The five observers 
who took part m the experiment showed consistency in their 



80 


PITCH 


judgments, even though some of them had previously made 
the statement that pitch is not the sort of thing they would be 
able to cut tn half The judgment is apparently easier than one 
might suppose, especially if the observer does not become con 
fused by the recognition of musical intervals when he sets the 
variable tone 

The geometric mean of the results of five observers is shown 
in Fig 25 This function tells us at what frequency the variable 
tone must be set m order that it shall sound half as high in pitch 



! gh in pitch as a standard tone of another frequency (abscissa) (Stesens 
Votkmann and Newman ) 

as the fixed tone Hence, this function giv cs us the relationship 
that we need to know in order to construct a scale of pitch whose 
numbers are such that the number N stands for a pitch which 
sounds half as high as that represented by the number 2 N 
The pitch scale w as constructed by assigning arbitrarily the 
wiroJtsw LQOd co the pitch, of a IQQft-ojcle tone, and. the number 
500 to the pitch of the tone which sounds half as high, as de 
termincd from the curve in Fig 25, similarly for the pitch 
number 250, etc By extending this procedure, we obtain the 
pitch junction shown in Fig 26 This function expresses the 
relation between pitch and frequency, at a constant loudness- 



RELATION OF PITCH TO FREQUENCY 


81 


level, and therefore it constitutes a scale of pitch satisfying the 
criteria laid down for it. This pitch-function has, within the 
limitations of the particular experimental procedure by which 



Fig 26 The pitch function This curve shows how the perceived pitch of 
a tone (measured in mels) changes as a function of the frequency of the 
stimulus. This pitch function was determined at a loudness-level of 60 db. 
(Stevens, Volkmann, and Newman ) 

it was established, the numerical significance that the numbers 
on the pitch-scale are related to each other as the subjective 
magnitude of the pitches. The unit of the scale has been called 
a mel (from the root of the word melody). A pitch of 500 mels 
sounds twice as high as a pitch of 250 mels, provided "twice as 
high” is understood as being defined by the operations employed 
to establish the scale. 




82 


PITCH 


RELATION OF THE PITCH SCALE TO 
ALTERNATIVE PROCEDURES 

The question arises concerning the possibility of verifying 
this pitch function by other procedures, such as bisecting tonal 
intervals, i e, setting a third tone to a value halfway, in pitch, 
between two other tones Such verification is theoretically 
possible — in fact, it is theoretically required, if the pitch scale 
is to be more than trivial 

The method of bisection has been applied to tonal intervals, 
but the results of different investigators have thus far been con 
tradictory (Pratt, 1) Some workers have insisted that the 
bisection is made at the arithmetic mean, and some claim evi 
dence for bisection at the geometric mean A famous contro- 
versy was waged about this point (Titchener, 1) From the 
form of the pitch function m Fig 26, it is evident that the point 
of bisection of an interval should depend upon the position of 
the interval on the frequency scale It would be desirable to 
test the pitch function experimentally by the method of biscc 
tion 

In testing one function by another, we must, however, pro- 
ceed cautiously to our conclusions The ability of any two 
methods of bisection or fractionation to confirm each other is 
limited chiefly by the presence of constant errors in the expen 
mental procedures (compare the case of loudness, p 120) 
Some of the factors which may introduce constant errors are 
the size of the interval and its position on the stimulus scale, 
the order of presentation of the stimuli, the rate of presentation, 
the initial value of the variable stimulus (compare Geiger and 
Firestone), and the effect of what is known as “absolute judg 
ment,” namely, a tendency to adjust the variable tone to a value 
which is to some extent independent of the values of the limit 
aww,, biJA d/yfftsd/utf* 'upa?. 'prsj‘jbn% vai tgnrjstx Tbit, 
final choice of a function to be used as a pitch scale will, there 
fore, be subject to revision whenever the sources of constant 
errors in the experimental procedures for fractionation can be 
detected and eliminated 



THE USES OF THE PITCH SCALE 


83 


THE USES OF THE PITCH SCALE 
Having established a tentative pitch scale we can use it to 
measure certain psychological magnitudes, and, by comparing 
it with physiological data, we can obtain information as to the 
probable basis of the judgment of pitch magnitudes Some of 
these relations are treated elsewhere (see p 95 for measurement 
of the size of difference hmens and p 97 for relation of pitch 
scale to basilar mechanics) 

The Measurement of Musical Intervals An interesting ap 
plication of the pitch scale occurs in the measurement of the 
subjective size of the musical intervals We can measure the 
perceived size of various octaves by determining from Fig 26 
how much the pitch changes from, one octave to the next In 
a similar way we can measure the size of other musical inter 



mcls) changes as a function of the frequency of the geometric mean of the 
interval The subjective size of d fferent octaves may vary by as much as 
twentyfold (Steven3 Volkmann and Newman) 

vals In general, the subjective size of any musical interval is 
proportional to the slope of the pitch function at the frequency 
which falls at the middle of the interval 

In order to illustrate these relationships, the subjective sizes 
of successive octaves and fifths, as measured in mels, is plotted 
in Fig 27 The plot for other intervals would be similar in 
form but different in ordinate value 




84 


PITCH 


Quite definitely, musical intervals become subjectively larger 
as frequency increases up to the fourth octave above middle C 
(up to -4096 cycles) In other words, throughout the useful 
musical range, intervals increase in perceived magnitude with 
increasing frequency of the stimulus This conception is not 
entirely novel Stumpf decided that, in spite of the great dif- 
ficulty of making these subjective comparisons directly, the 
upper octaves are perceptually larger than the lower octaves 
Thus the pitch scale enables us to confirm Stumpfs judgment 
Incidentally, these facts contradict the prevalent notion that 
“equal ratios of frequency give rise to equal intervals of pitch” 
(A H Davis, p 235) 

iDIFFEBENTIAt SENSITIVITY TO FREQUENCY 

It is important for many purposes to determine accurately 
the minimal change in the frequency of a tone which can be 
detected by the ear The size of the just noticeable difference 
in frequency determines the differential sensitivity, or the re 
solving power, of the ear This minimal change in frequency 
is known as the diffcrencc-ltmeti (DL) Strictly, the concept 
of differential sensitivity should be defined as the reciprocal of 
the DL The size of a DL is a function of the frequency and 
the intensity of the tone, and is different for different observers 
As m the case of the threshold of hearing, the concept of the DL 
is essentially statistical Since the differential sensitivity of a 
living organism is in a continuous state of fluctuation, the ideal 
value for a DL would be that difference which is detectable by 
the organism 50 per cent of the time This ideal is approxi 
mated in most psychophysical work (cf Guilford for discussion 
of psychophysical methods) 

The pioneer work on discrimination of frequency is asso- 
ciated with "Prey er (TK/6),'Lutt and 'Meyer It 

was characterized chiefly by exceptionally small values for the 
DL (see Vance for historical summary) The next group of 
experimenters, Vance, Schaefer, and Stucker, agreed well with 
each other m determining DL’s, but found them to be stgmf 
icantly larger than those of the pioneer w others (Vance) Two 



DIFFERENTIAL SENSITIVITY TO FREQUENCY 


85 


representative sets of data are plotted in Fig 28, one for Luft and 
one for Vance This plot is in terms of the relative DL — the 
ratio of the DL to the frequency at which it was determined 
The early work can probably be criticized on the ground that 
extraneous cues for the identification of the lower or higher 
tone were not eliminated It is particularly difficult to eliminate 
these cues when tuning forks are used as the source of sound 



Fig 28 The various values of the relative differ cnce-limen (AF/F) 
obtained by various experimenters under different experimental conditions 

Also, Luft used a different method (method of minimal change) 
and hence a different criterion for the DL from that employed 
by Vance, -who used the method of constant stimuli The 
selection of observers may also influence the size of the observed 
UL. Thus, it we average the results ot the ‘best ten out ot 
Vance’s fifty observers, we obtain results which ui Fig 28 can 
be seen to lie well below the average for the fifty observers 
The first systematic determination of differential sensitivity 
to frequency by means of electrically generated tones was con 
ducted by Knudsen (1), who obtained results m good agree- 
ment with those of Vance 

The shortcoming of all this early work is the fact that no 
single experimenter was able to measure DL’s at all audible 




8 6 


PITCH 


frequencies and at all levels of intensity. It remained for 
Shower and Biddulph to make a more thorough investigation 
of differential sensitivity. They covered the frequency-range 
from 31 to 11,700 cycles at sensation-levels ranging from 5 db 
above threshold to the maximal level which the observer could 
tolerate at any gi\ en frequency. They used an essentially novel 
method, in order to minimize the effects of harmonics and of 
the transient frequencies which arise whenever a tone is turned 
on or off abruptly. A rotary condenser in the tuning circuit 
of an oscillator was so arranged that the observer could listen to 
a tone of unvarying pitch for a short interval of time. Then 
the frequency was changed sinusoidally to a new value, to which 



Tig. 29 The dependence of the relative difference hmen (AF/F) upon the 
rare at which the frequency of a tone is varied from the higher to the lower 
frequency The difference between the higher and the lower frequency 
defines the value A F. (Shower and Biddulph.) 

the observer listened for another short interval of time^ where- 
upon the frequency returned, sinusoidally, to its original value. 
In other words, there was a smooth transition from one fre- 
quency to the other. The difference between the two fre- 
quencies was controlled by the separation between the plates 
of the rotary condenser, and the observer reported when this 




DIFFERENTIAL SENSITIVITY TO FREQUENCY 


87 


difference was just large enough for the variation in pitch to be 
detected 

Under this method the differential sensitivity of the ear 
becomes a function of the rate at which the frequency is varied 
By controlling the speed of the rotary condenser, this function 
was determined, as shown in Fig 29 The best rate of fre 
quency variation was taken to be 2 per sec 

The results plotted in Fig 28 are for binaural listening under 
the condition of Shower and Biddulph’s experiment Similar 
results were obtained for bone-conduction — a special case of 



FRCOUCNC. 


Fig 30 The dependence of the relative d fference 1 men (A F/T) upon 
frequency T1 e parameter here is sensation level a$ indicated by the numbers 
attached to the curves (Shower and Biddulph ) 

binaural listening For monaural listening, however, the rela 
tiYe DL’s are larger than for binaural In Figs 30 and 31, the 
results for monaural listening (averages from 10 ears) are 
plotted to show the effects of frequency and intensity on the 
relative DL (See Table I for tabulated data ) At low fre 




PITCH 


quencies and at low Intensities, the DL’s are largest. This 
effect of intensity upon the size of DL’s may well be due to the 
distribution and innervation of the hair-cells on the basilar 
membrane (see p. 275 and p. 369). 

The marked rise in the curves of Fig. 3Q at low frequencies 
is greater than that found by any previous experimenters, but 
it is precisely at these low frequencies that the effects of har- 



scnsation level Here the parameter is frequency, as shown by the numbers 
attached to the curves (Shower and Biddulph ) 

monies and transients would be most effective in artificially 
lowering the DL. By repeating the experiment, using abrupt 
variation from one frequency to the other, much smaller DL's 
umwthtiaisft.'h'trjAJw itaeaa. 

At frequencies above 500 cycles the relative DL’s (AF/F) are 
approximately constant. Below 500 cycles the absolute DL's 
( (AF) are approximately constant, except for \ery low frequen- 



Sensation 

level 


Frequency 

31 

62 

125 

250 

500 

1000 




90 


PITCH 


ber of transients created, but we have disregarded the fact that, 
no matter what the mode of transition, energy is scattered into 
other frequency regions Even when the frequency is allowed 
to vary sinusoidally, the result is a sound w ith a complex spec- 
trum, rather than a simple one Change the frequency from 
495 to 505 cycles and back again twice per second and wc have 
precisely the equivalent of a set of steady tones spaced 2 cycles 
apart on each side of 500 cycles (see Chapter 9) These steady 
tones, or ‘side bands,’ enter the ear simultaneously and set up 
on the basilar membrane a pattern of stimulation If the fibers 
of this membrane were very much more sharply tuned than 
they arc, the ear would be able to respond separately to each of 
these individual tones, but, since the tuning is not sharp, the 
disturbances overlap and the tones beat with one another 
Now, how arc we to understand the situation in the expen 
ment of Shower and Biddulph ? They found that, for 1000 
cycles at 80 db above threshold, the just noticeable increment 
in frequency is exactly 3 cycles In other words, when the two 
frequencies, 1000 and 1003 cycles, were presented alternately, 
with a sinusoidal transition from one to the other, at the rate 
of two alternations per second, the observer could just detect 
the effect Of what, wc may ask, did the stimulus actually 
consist from the point of view of its Fourier analysis ? Plot A 
of Fig 32 provides the answer It shows the approximate amph 
tude of the several components of the stimulus These com 
ponents are spaced 2 cycles apart and are symmetrical about the 
central component, whose frequency, in this instance, is 1001 5 
cycles These are actually the steady tones which wc send into 
the ear when we modulate the frequency of a sound wave, after 
the manner of Shower and Biddulph Nev crthclcss, the car 
hears such a stimulus as an alternation between two pitches — 
an amazing effect which must be attributed to the fact that the 
components are close enough together on the basilar membrane 
to beat with one another Each of the three major components 
sets into vibration a region of the basilar membrane, and we 
might represent the maximum amplitude of these forced vjbn 
ttons by the solid curves of plot B in Fig 32 However, the 



SPECTRAL ANALYSIS OF THE STIMULUS 


91 


relative phases of these vibrations are constantly changing, be- 
cause the frequencies are different Consequently, there will 
be a time when the central 
and left hand components 
will reinforce each other, but A 
w ill be out of phase with the 1 

component on the right I 11,1 

Then the effects will sum ~ ^ ^ j 

mate to produce a disturbance g * v i 

on the basilar membrane j \ 

whose amphtude is repre / / * \\ \ 

sented schematically by the { / ! \ \ ' 

left hand dotted curve At a 
later time the phases will > ^// / 
combme in such a way as to 7 “ 

... . 1 . . Fig 32 Showing how the spectrum 

give a result like the dotted e f a f rc q Ucn cy modulated tone affects 
curve on the right At in the basilar membrane. Plot A shows 
termediate times the maxi the spectrum of a 1 000-cycle tone which 

r j > ,, is modulated by a just perceptible 

mum of the disturbance Will ajnoun, plot B demonstrates how the 


curve on the right At in the basilar membrane. Plot A shows 
termediate times the maxi the spectrum of a 1 000-cycle tone which 

r J . 1 ,, is modulated by a just perceptible 

mum of the disturbance Will ajnoun, plot B demonstrates bow the 
he between the peaks of the patterns of disturbance on the mem 
two dotted curves, and It IS branc due to the several components 

the shifting back and forth of 

this maximum which pro- between the two positions occupied by 
vides us with the impression the dotted curves 
that the pitch of the tone is alternating between two just 
discriminable values (These curves are only schematic, in 
order better to illustrate the principle, they are drawn much 
sharper than the actual peaks of the disturbance in the ear ) 
Now, if we knew the exact form of vibration of the basilar 
membrane at all frequencies and intensities, we could determine 
the precise distance between the peaks of the dotted curves in 
Fig 32, and this distance could be taken as the resolving power 
of the ear In the absence of this knowledge, we probably do 
well to take the measured DL*s as the best indication of resolv- 


ing power Nevertheless, while we are considering the expen 
ment on differential sensitivity in terms of the spectra of the 
stimuli which produce a just noticeable difference, it is of mter 



92 


PITCH 


est to examine some of these spectra The spectra in Fig 33 
were constructed, from the data presented by Shower and 
Biddtilph, by considering the transition from the higher to the 




20 DB 

■man 1 


Jj 

■1' 

■B 

B 


| 

■1 

E 

,i iii, iii, 

| | 


S 


.illmlli, 

.r/M/f. 



.ilnniiiiiiinli. 

. ill llliltl II i , 


Fig 33 The spectra of tones at various scnsauon-levels whose frequency 
is modulated by the amount needed to produce a just pereeptible change in 
pitch The frequency of the central component is indicated at the left The 
rate of modulation was 2 per second and the components are spaced apart by 
2 cycles 

lower tone as intermediate between the rectangular and sinu- 
soidal forms (sec van dcr Pol, 2, for analysis of these two types 
of modulation) These are, to a fair approximation, the spec- 
tra which, upon entering the car, give the listener an impression 
of a tone which alternately changes in pitch by a just perceptible 
amount Because the rate of alternation from the high to the 
low tone was 2 per second, the components of the spectra are 
spaced 2 cycles apart, and their relative amplitudes are propor- 
tional to the heights of the lines 

At the low intensity of 5 db above threshold, where the ear 
is relatively insensitive to small differences of frequency, the 
spectra are composed of more numerous bands, and the relative 
sizes of the outer bands are greater than at higher intensities of 
the same frequency When the components are numerous the 
same principle applies as when there are only three— the sue- 










SPECTRAL ANALYSIS OF THE STIMULUS 


93 


cessive cancellation and reinforcement by the different com 
ponents cause the pomt of maximal disturbance on the basilar 
membrane to move back and forth For a given frequency of 
the central component, the excursion of this disturbance is 
greater when the side bands are distributed more widely As 
the frequency increases, the spectra, as we should expect, grow 
wider, but there is very little difference in the spectra producing 
a DL at frequencies below 500 cycles Even at frequencies 
below 100 cycles, where the spread of the disturbance on the 
basilar membrane is presumed to be quite extensive, the indica 
tions are that a spectrum of sound probably produces a DL 
through the operation of the same principles of reinforcement 
and cancellation as outlined for the case of a higher frequency 
Referring once more to the curve in Fig 29, we note that, 
with a 1000-cycle tone, a change of 3 cycles is just detectable, 
both when the alternation is made once and when it is made 3 
tunes per second Since the spectrum of a modulated sound 
depends upon both the rate and the range of the modulation, 
the spectra will appear quite different in these two cases When 
the alternation is 3 times per second, 
the components are spaced 3 cycles 
apart, and their relative amplitudes 
are as pictured in plot A of Fig 34 
When the rate is once per second, the 
separation between bands is only 1 
cycle, and the relative amplitudes are 
as shown in plot B These two rather 
different looking array* of steady 
tones produce identical DLs, pro- 
vided we define the DL as the total 
extent of the modulation, and pre 
sumably they do so in accordance 
with the same principle of reinforce 
ment and cancellation. Owing to 
the beating of the various components, the maximal disturbance 
on the basilar membrane shifts back and forth through an equal 
extent for both spectra 


A 



Fig 34 The spectra of a 
tone (1000 cycles) whose fre 
quency is modulated by a 
just perceptible amount, at 
two different rates 3 per sec 
ond (plot A) and 1 per sec 
ond (plot B) 



94 


PITCH 


A curious coincidence is the fact that the lowest value 
obtained for the relatuc DL for frequency is the same in both 
vision and audition In both modalities the smallest ratio AF/F 
ls very nearly 0£K)2 This suggests the question, “Docs the 
alternate presentation of two different visual stimuli scatter 
energy to other frequencies?” Presumably it does If wc 
were to raise and lower, at a rate of 3 per second, the frequency 
of a single wave length of light (provided such could be ob 
tamed), the resulting stimulus would presumably have com- 
ponents spaced 3 cycles apart, but, although the ear can resolve 
tones separated by 3 cycles, we cannot hope that the eye could 
do so, because the DL for frequency in vision is of the order of 
lO^cyclcs The enormously high frequency of light would 
render trivial the effects of modulation Therefore, the com 
cidence of values of the relative DL for frequency in both vision 
and audition does not depend upon the detection of the effects 
of modulation in both cases 

THE INTEGRATION OF DIFFERENCE LIMENS 

The integration of the DL’s for frequency sen es many pur- 
poses We may wish to know the total number of discriminate 
pitches in the range of audible frequencies, we may desire to 
measure the subjective magnitude of a DL by means of the 
pitch scale, or wc may be interested in comparing the integrated 
DL’s with certain physiological functions (sec Chapter 15) 
The purpose of the integration will determine between what 
limiting frequencies and along what path we shall make the 
integration The summation can, of course, best be made by 
graphical methods When the ratio F/ AF, the reciprocal of the 
relative DL, is plotted against the logarithm of the frequency, 
the area between any two limits under the curve is proportional 
to the number ot TAJ' s'bctween tnose two'nmits 

The Number of Dtscrimtnablc Pitches On integrating 
along the reference level for pitch -comparisons (40-db loudness- 
level), between the limits of 20 and 12,000 cycles, wc find that 
the ear can discriminate about 1400 different pitches From 
present data we cannot extend the integration to 20,000 cycles, 



THE INTEGRATION OF DIFFERENCE LIMENS 


95 


but a reasonable extrapolation of the curves in Fig 30 makes 
it appear that between 20 and 20,000 cycles there are nearly 1500 
discriminable pitches Along the loudness-contour at the 60 db 
level the number of discriminable differences between 20 and 
20,000 cycles is about 1800 At the 60-db level the number of 
distinguishable pitches is maximal and is approximately 3 times 
as large as the number at the very low level of 5 db 

The Subjective Size of Difference Limens for Frequency 
The experimental procedure for determining the objective ex 
tent of a DL (m cycles) is such as to yield a measure of the 
resolving power of the ear, but it obviously does not provide 
for the measurement of the subjective size of the DL s Dis- 
covery of the increment in frequency which is just noticeable, 
first at 200 cycles and then at 2000 cycles, does not disclose 
whether or not, at these different frequencies, just noticeable 
increments are subjectively equal in magnitude Fechner as- 
sumed, quite arbitrarily, that, when two increments are both 
just noticeable, they are subjectively equal Now, it is clear 
that two DL’s are equal in the one respect of being both just 
noticeable (Boring, 2) But does this mean that they are also 
equal in respect of subjective magnitude ? Since the operations 
for determining DL’s are different from those for determining 
equal subjective magnitudes, we are not justified in assuming 
that DL’s possess equal subjective magnitude until they have 
been compared in terms of a subjective scale 

If all DL s at a given loudness-level are of equal subjective 
magnitude, their integration should yield a function identical 
nvfh the pitek function of Fig 26 This is equivalent to say- 
ing that, if all DL’s are equal, the pitch arrived at by summating 
100 DL’s should appear half as high as the pitch obtained by 
summatmg 200 DL’s If we take the pitch function of Fig 26 
as defining a numerical scale of perceived pitch, we can com 
pare it to the integrated DL’s, m the manner shown in Fig 35 
The close correspondence between the pitch function (solid 
curve) and the function (solid squares) obtained from integrat- 
ing DL’s shows that, within the limitations of present measure- 
ments, all DL’s for frequency are of equal subjective magnitude 



% 


PITCH 


This relation is true, of course, only provided the DL’s and the 
pitch-function are determined at a constant loudness-le\el. At 
other loudness-levels the pitch-function would need to be cor- 



Dc 35. The relation of the pitch function (solid curve) to the integrated 
DL’s for frequency (solid squares) and to the experimental location of position* 
of vibration on die basilar membrane (circles) The ordinate scale (on left) 
shows the number of DL’s as a function of frequency, when the integration 
is made at a loudness level of 60 db The pitch scale in mels can be obtained 
by multiplying these ordinate values by the factor 2.S3. The relame locations 
of positions of vibration on the basilar membrane arc obtained by laying the 
linear extent of the membrane along the ordinate Thus the ordinate scale 
on the right represents the relative linear extent of the membrane, in both man 
and guinea pig (Stevens, Volkmann, and Newman ) 


rcctcd slightly in terms of the shift of pitch with change of 
intensity (Fig. 23) . We could then compare the corrected pitch- 
function with an integration of DL’s at the loudness-level of, 
let us say, 5 db. Such a comparison shows that, within the 
accuracy of the available data, all DL’s at the 5-db level are also 
of equal subjective size. 

How, then, do DL’s at different loudness-levels compare ? 
Since there are about 3 times as many discriminablc pitches at 
the 60-db level as at the 5-db level, the DL’s at the 5-db level 




THE INTEGRATION OF DIFFERENCE LI MENS 


97 


are about 3 times as large, subjectn ely, as those at the 60-db 
level This relation is shown by the fact that, on integrating 
the first 100 DL’s at 60 db, we arm e at a pitch of about 250 mels, 
whereas the integration of 100 DL’s at 5 db brings us to a pitch 
of about 800 mels In general, since the correction to be made 
in the pitch function when loudness-level is changed is slight, 
and since the mtegrations of DL’s at various levels give essen 
tially similar functions (differing only by a constant of pro- 
portionality), we may conclude that, as a first approximation, 
the relative subjective size (m mels) of DL’s at different loud 
ness-levels is given by their relative objective size (in cycles) 
Hence, from Fig 30 we see that, as the intensity is increased, 
the subjective size of DL’s must decrease Above a sensation 
level of 60 db, however, the size remains essentially constant 
These relations between the subjective size of DL’s for fre 
quency are very different from those for intensity (see p 148) 
Relation to Basilar Mechanics On the assumption that to 
each audible frequency there corresponds a position on the 
basilar membrane ‘tuned’ to that frequency, it is interesting to 
note certain relations In Fig 35 are plotted circles represent 
mg, approximately, the positions of excitation along the basilar 
membrane (ordinate) as a function of frequency (Stevens, 
Davis, and Lurie) The similarity between the three functions, 
(a) position on the basilar membrane, (£) the pitch function, 
and (c) the integrated DL’s, supports the original assumption 
of Wegel and Lane to the effect that any two tones are just 
discnminable in pitch when they stimulate two areas of the 
banlar tncffibcaryi separated by a certain constant distance 
The correspondence of these functions suggests another 
interesting hypothesis Apparently, when an observer is asked 
to set a second tone to half the pitch of a given tone, he changes 
its frequency until it stimulates a position on the basilar mem 
brane midway between the position stimulated by the given 
tone and the apical end of the membrane (Stevens, Volkmann, 
and Newman) He is, of course, not aware of these locations 
as such, but the underlying physiological process which makes 
comparison of pitches possible seems to be characterized chiefly 



98 


PITCH 


by spatial differentiation Although there are subsequent ccn 
tral nervous processes, the form of certain discriminatory func 
tions is evidently imposed by the receptor mechanism 

THE PITCH or COMPLEX SOUNDS 
Although wc define the concept of pitch in terms of the 
perception of pure tones, it is clear that noises and other 
aperiodic sounds may have a more or less definite pitch In 
general, the pitch of a complex sound depends upon the fre 
quency of its dominant components Thus we find that when 
observers are asked to designate the pitch of a tonal mass com 
posed of numerous frequencies, all lying within a restricted 
band, the observers name a pitch which is dose to the center of 
the band (Ehdahl and Boring) Of course, the pitdi of a 
noise, or of a tonal mass, is more or less indeterminate, depend 
mg upon the range of frequencies present 

The most extremely complex sound is that commonly called 
thermal noise This is the noise heard when the voltage due 
to the random motion of electrons m an electrical resistance is 
amplified and impressed on a loud speaker Provided the 
amplifier and speaker hive uniform characteristics, the resulting 
noise contains all audible frequencies at equal intensity It 
sounds much like a hiss or a shhh Of course, the ear does not 
distinguish the presence of any of the individual frequencies 
in the continuous spectrum of this noise, but when any band 
of frequencies is eliminated by means of electrical filters, a 
striking change occurs in the sound This fact show s that those 
frequencies which were filtered out hid their effect in the total 
noise As might be expected the pitch of thermal noise is quite 
indeterminate However, the observer is not aware of an> 
extremely low or extremely high characteristics Frequencies 
of the middle range — “iUUD to'SUUO ejeles— appear to dominate 
the sound Since the ear is most sensitive throughout the 
middle range, these components arc the ones, in a thermal noise, 
which are most effective in producing stimulation, and hence 
are most effective in determining pitch 

When the number of components m a sound are few enough 
in number, the ear can resolve the complex into its individual 



THE CASE OF THE MISSING FUNDAMENTAL 


99 


frequencies Thus a trained observer can discriminate the 
upper partials of a vibrating piano string, or pick out the mdi 
vidual instruments in a symphonic chord The fact that the ear 
resolves a complex tone into its components is known as Ohm’s 
acoustic law Presumably this capacity depends upon an ability 
to discriminate the separate areas of excitation on the basilar 
membrane When the areas are too numerous, however, or 
overlap too much, resolution fails 

THE CASE OF THE MISSING FUNDAMENTAL 

Whenever a complex tone is composed of frequencies differ 
mg by a constant amount of 100 cycles or more, the apparent 
pitch of the complex mass is not the mean of the component 
frequencies, but is that of a tone whose frequency is equal to 
the constant difference (Fletcher, 3) Thus, when the com 
ponent frequencies 700, 800, 900, and 1000 cycles are sounded 
together, the pitch is judged to be that of a 100 cycle tone 
When the components 400, 600, and 1000 cycles are sounded, 
the pitch appears to be that of a 200-cycle tone It is possible, 
then, to demonstrate a paradoxical phenomenon When to 
the combination of 400, 600, 800, and 1000 cycles the tones 500, 
700, and 900 cycles are added, the pitch appears to drop by 
precisely an octave In other words, under these special condi 
tions, the addition of pitches, each of which is higher than the 
apparent pitch of the complex, may result in a lowering of the 
perceived pitch of the ensemble 

These facts make comprehensible the striking experiment in 
which, the fundamental, frequency of a musical note u removed 
by selective filters without changing the apparent pitch of the 
note Since the harmonics present in the note differ by a con 
stant amount, namely, an amount equal to the frequency of 
the fundamental, the harmonics alone are sufficient to determine 
the pitch of the note Even when all the frequencies below a 
certain value — 300 cycles, for example — are removed from 
a musical selection, the quality of the music is altered to an 
astonishingly small degree The Bell Telephone Labora 
tones have developed phonograph records illustrating these 
phenomena 



100 


PITCH 


RELATION OF PITCH TO DURATION 

The foregoing laws of pitch perception relate to sounds pre 
sented for a second or more What happens, then, to pitch 
when the duration of sounds is reduced to smaller and smaller 
values ? Clearly, in the limiting case, when only one cycle of 
a tone is presented, one hears a click rather than a tone It 
might be urged that a click has no pitch, but, on the other hand, 
casual observation shows that some clicks sound higher than 
others A systematic problem would be to determine whether 
a single wave taken from a 1000-cycle tone sounds higher in 
pitch than a single wave taken from a 500-cycle tone The 
wave from the higher tone would represent a sharper pressure 
gradient, or a steeper wave front, and might well give rise to 
the perception of a higher pitch when the two waves are directly 
compared (see p 282) 

A related problem is the determination of the number of 
cycles required for a tone to be perceived as having a definite 
tonal quality Historically this problem has been phrased as 
though a single answer were possible — as though a tone lost 
its pitch quite suddenly when the duration was reduced to a 
certain small value Actually, however, as the duration of a 
tone is decreased, several changes take place More specifically, 
if we begin by presenting a 1000-cyclc tone for a very brief 
period, say 2 or 3 msec, and then increase the tones duration, 
the sensation passes through three principal stages First, one 
hears a nearly toneless click which seems to be without pitch 
Second, the sound acquires a more or less definite pitch, al 
though the click remains one of its prominent aspects, but this 
apparent pitch is different from the pitch of the same frequency 
sounding for a longer time Finally a duration is reached at 
which the pitch can be ascertained without the constant error 
typical of the second stage 

As we have already seen, when the tone is m the first stage 
and is ‘pitchless * closer observation may reveal that it has some 
degree of pitch, that is to say, a listener may be quite certain 
> that the tone is higher or lower than some other tonal standard 
Whenever such comparisons can be made, we have a method 



RELATION OF PITCH TO DURATION 


101 


of assigning a pitch to a sound, even though the range of un- 
certainty may be large 

In the second stage, where the duration is of the order of 
10 msec, the pitch, although growing more definite, is clearly 
lower than for longer durations One claim has been made 
that the pitch of high tones is lowered by shortening their dura 
tion, but that the pitch of low tones is raised (Burck, Kotowski, 
and Lichte, 2) An experiment now m progress seems to show, 
however, that the apparent pitch of all tones (at least from 250 
to 8000 cycles) falls with shortened duration (Ekdahl and 



Fig 36 Showing how the pitch of a tone changes as a function of Us 
duration. 

The ordinate shows at what frequency a tone lasting 15 sec sounds equal 
in pitch to a 1000-cycle tone presented for the period of time indicated by the 
absassa Example a tone of 1000 ejeles lasting 001 sec sounds equal in 
pitch to a tone of 842 cycles lasting 1 5 sec Each point represents the average 
of 60 observations by a single observer (Ekdahl and Stevens) 

Stevens) Fig 36 shows the course of pitch as a function of 
the duration of the stimulus 

The line between the second and the third stage is not 
sharp, and it vanes from person to person Burck, Kotowski, 
and Lichte tested several people and found as typical the results 
plotted in Fig 37 The absolute time necessary for the identi 
fication of the pitch of a tone is smallest in the middle range of 
frequencies, where it is approximately 0 01 sec The number of 
sound waves in these tones can be found, of course, by multiply 




102 


FITCH 


ing the frequency by the duration, and wc find that from 3 to 
4 waves arc required to specify the pitch of tones below 200 



TREQUENcr 

Fjo 37. Showing hmv long a tone of a given frequency must last in order 
to produce the experience of a definite pitch, according to the criterion used 
by Burck, Kotomki, and Lichte (2) 


cycles. At 1000 cycles about 12 waves are needed, and at 10,000 
cycles the number jumps to about 250. 

Now, the points in Fig. 37 show the time a tone must last 



srcoNos 


Fic 33. The dependence of the relative difference hmen (A F/F) upon the 
duration of a tone (800 cycles). The curves arc for two observers (After 
Bekesy, 4 ) 


> in order for its pitch to be perceived, only provided wc accept a 
certain criterion for what we mean by perception. The crite- 




RELATION OF PITCH TO DURATION 


103 


non used in this case was arbitrary, but acceptable for the pur- 
pose in hand From another point of view, the proper measure 
of the presence or absence of pitch in a tone is the precision with 
which the pitch can be identified Since our measure of pre 
cision m sensation is the DL, we might inquire into the effect 
of duration on the DL for frequency discrimination Data 
in this field are scant, but Bekesy (4) presents the results, for 
two observers, as shown in Fig 38 These curves demonstrate 
clearly that the loss of pitch occurs gradually as the duration of 
the tone is shortened Some pitch would remain, according to 
this criterion, just as long as the DL is small enough to be 
measurable 

At this point, we might profitably turn to an analysis of 
what a short tone actually consists, with regard to its sound 
spectrum Wc have already seen the advantage of the type of 
analysis which shows us that, whenever we alter the frequency 
of a tone for the purpose of measuring DL s, we introduce addi 
tional frequencies and create thereby a complex spectrum The 
same relation is true of the mere turning on and off of a tone — 
we complicate its structure by so doing A single pure tone 
sounding throughout eternity would have a solitary frequency 
in its spectrum, but a tone lasting for any finite time has an 
infinite number of frequencies in ns spectrum Fortunately, 
all these frequencies do not have the same amplitude Only 
when the sound lasts an infinitely short time is the spectrum 
uniform throughout, and its frequency, therefore, completely 
indeterminate As the duration is increased, the band of the 
spectrum containing most of the sound energy narrows in 
width, and when, as we have seen, the duration is infinite, the 
spectral band is infinitely narrow These notions are inherent 
m the principle of uncertainty as it has been developed in 
physics, and they apply to any periodic phenomenon From 
the principle of uncertainty we conclude that the accuracy with 
which the frequency of a sound can be determined is propor 
tional to its duration Stated symbolically the principle is 


A/A/ = 1 



where A / is a measure of the width of the mam peak of the spec- 
trum, in cycles, and A t is the duration of the tone (Stewart, 4) 
As an example of the spectral distribution of the amplitudes 
of the component frequencies in an 800-cycle tone whose dura- 
tion is but a single period (A/ — 1/800 sec), consider Fig 39 
(Burck, Kotowslu, and Lichtc, 5) The maximal amplitude 


50 100 200 500 1000 2000 5000 10000 

FREQUENCY 

Fic 39 The spectral distribution of the energy in a wav e consisting of 
a single cycle out of an 800-c>cIe tone. The wave lasts 1/800 sec and the energy 
«s distributed in continuous bands spaced apart by 800 cycles (After Bucck 
Kotowski and Lichtc, 5 ) 


occurs at 800 cycles, and the width of the mam band, at an 
amplitude equal to one half the maximal amplitude (44 db), 
is also 800 cycles (= 1/A/) Other continuous bands containing 
higher frequencies occur at intervals of 800 cycles, but their 
amplitudes fall off with frequency as shown by the dotted curve 
Now, when the ear is required to ascertain the pitch of a 
very short tone, it must resolve a continuous spectrum whose 
effective width is inversely proportional to the duration of the 
tone Obviously, neither the car, nor any other frequency - 
analyzer, can assign a pitch to a continuous spectrum, except to 




THE THRESHOLD OF SUCCESSIVENESS 


105 


say that it lies somewhere within the spectrum We can, how 
ever, refer to the curve in Fig 37 and calculate the form of the 
spectrum, which just yields a sensation having a definite pitch, 
according to the criterion used Such calculations show that 
the pitch of a tone is detectable when 70 per cent of the energy 
in the spectrum lies between rfc 5 per cent of the principal fre- 
quency (Burck, Kotowski, and Lichte, 7) But, if we adopt 
a more liberal criterion for deciding whether a brief tone has 
pitch, the spectral distribution of frequencies may be much 
wider without making the pitch completely indeterminate 


THE THRESHOLD OF SUCCESSIVENESS 

One additional fact relating to the duration of tones should 
be mentioned here Experiments have been carried out to de 
termine by how much the onsets of two tones must be separated 
m time m order for them to appear successive rather than simul 
taneous (Strcckcr, Burck, Kotowski, and Lichte, 3) The prac 
tical problem arises in telephone communication When two 





M CYCttS 


Fig 40 The just detectable temporal separation in the onsets of two tones 
(F, and F 2 ) If the tune interval for a given average frequency 

is shorter than the value on the ordinate only a single onset is experienced 
if the interval is longer one hears two successive onsets (After Burck 
Kotowski, and Lichte 3 ) 

different frequencies are sent over a long transmission line, one 
of them may be retarded more than the other Obviously, if 






106 


PITCH 


the frequencies m a speech wave are retarded unequally by 
large enough amounts, intelligibility may suffer Hence, it is 
important to know within what limits the ear will tolerate suc- 
cessiveness in the onsets to two sounds 

The curve of Fig 40 shows the just-detectable difference in 
time of onset for two tones of different frequency The two 
tones were chosen so that the relation between their frequencies 
was always Fa = 1 1 Fi The average of the two frequencies 
is plotted along the abscissa 

The chief item of interest in Fig 40, according to Burch, 
Kotovvshi, and Lichte, is that the times involved are rather 
similar to the time required for the recognition of the pitch of 
a tone In other words, if the first tone sounds long enough 
for its pitch to become established m the car of the listener 
before the second tone begins, the tones will appear successive 
If the time between the tones is insufficient to establish the pitch 
of the first before the second tone arrives, the onsets will appear 
simultaneous 

Or, we might consider another, but related, aspect of the 
phenomenon Common experience with electrically generated 
tones demonstrates that, whenever a tone is switched on sud 
dcnly, a sharp click is heard The same is true when a tone 
ends abruptly, and the effect is usually not due to faulty switch 
ing, but to the generation of transient vibration Now, if the 
transients due to the onset of the first tone overlap sufficiently 
those arising from the second tone, the tones will seem to begin 
simultaneously and to be accompanied by a single click In 
order to achieve succession, the two clicks must not overlap 
enough to fuse into a single click 

Of course, if the tones arc turned on gradually, nther than 
abruptly, the threshold of succession is raised, as we might 
expect, and a longer separation is required to avoid the expen 
ence of simultaneous onset Here the beginning of the tones 
is not so sharply defined as when the transient click is fully 
developed 



ABSOLUTE PITCH 


107 


ABSOLUTE PITCH 

Many conflicting claims arc made regarding the ability of 
certain persons to name precisely the pitch of a musical note 
without the aid of a standard of reference Some people claim 
to recognize middle C, for example, at any time and any place 
The evidence for this ability is mostly at the anecdotal level, 
however, because systematic controls are difficult What we 
should like to mean by absolute pitch is the ability to name the 
pitch (or the frequency) of a pure tone without the aid of such 
devices as whistling or humming the note, i e , without the aid 
of the kinesthetic cues involved in the reproduction of the tone 
We should test the observer with pure tones, because complex 
tones, such as those produced by musical instruments, have 
distinctive qualities which may be readily recognized Thus, 
for many musicians, absolute pitch reduces to an ability to 
name notes placed on one particular instrument 

Nevertheless, some persons undoubtedly possess a genuine 
ability to recognize pitch which far exceeds the ability of the 
average individual In fact, a most striking feature of this 
elusive gift, held so much in awe by the musician seems to be 
its extreme variation among different individuals How ex 
treme this variation is among a large group of listeners, under 
controlled conditions, would be interesting to determine Per 
haps people differ less than has been supposed 

Several investigators have attacked the problem of the extent 
to which relatively untrained observers are able to acquire 
absolute pitch A recent study (Wedell) has confirmed pre 
vious demonstrations that relatively unmusical observers can 
learn to increase their accuracy in assigning pitch numbers to 
pure tones The greatest increase in ability takes place during 
the first few practice sessions, after which an indefinite plateau 
is reached This plateau represented, for four observers, an 
average error about one half as large as that present at the begin 
nmg of the experiment Apparently, in learning to recognize 
the pitch of tones m a given range, the observers do not learn 
individual notes, but rather they build up a more or less co- 
hesive subjective scale in terms of which they judge the place 



FITCH 


ment of notes within the range Hence, in this type of abso- 
lute pitch, the judgment is, at least in part, one of relation 

Unlike the case of hue in vision, the pitch scale is not marked 
by critical points to which the usual observer can anchor his 
judgments The primary hues provide more or less sharply 
defined points in the visual spectrum, which are readily recog 
mzed (Wcstphal), but which have no counterpart in the audi 
tory spectrum Pure spectral green marks a definite transition 
between colors which are yellowish and colors which arc bluish 
Middle C, however, is one tone in a homogenous continuum 
unmarked by qualitative turning points It is perhaps for this 
reason that absolute pitch must be regarded as a rare gift 

Many musicians, however, do not agree that the musical 
scale is completely lacking in such qualitative turning points 
as characterize the scale of hues Bachem insists that there is 
about the note C a certain “tone-chroma”— -a certain “C-ness” 
— which is the same for the note C in all octaves, and which 
is unlike the chroma of the note D This aspect of tones is 
presumably the same as what has sometimes been referred to 
as tonality, and its recognition is claimed by some to be the cue 
to absolute pitch 

Bachcm studied ninety cases of “genuine absolute pitch ” 
He reports that seven persons in this group “possessed infallible 
absolute pitch over the whole scale of the piano and for all 
musical instruments and physical apparatus with which they 
were tested ” Not even errors of half tones or octaves were 
observed The judgment was immediate and definite, and 
possessed a high degree of subjective certainty These people 
claimed to base their decision upon the immediate perception 
of tone-chroma, and to rely upon a recognition of the “height” 
of the tone for the identification of the particular octave to 
which the tone belonged 

Forty four persons had infallible absolute pitch, provided wc 
neglect the following types of errors 

1 Confusion between octaves 

2 Constant errors of a half tone in one direction (These 
errors may be due to the fact that there are several standards 



ABSOLUTE PITCH 


109 


of pitch m use, le, A is not always taken as 440 cycles) 

3 Errors of a half tone downward m the highest part of 
the musical scale, and of a half tone upward in the lowest part 
(These errors are so common that they all but represent the 
rule) 

In addition, eight persons had good absolute pitch over the 
limited range of three to four octaves This type may be called 
“regional” absolute pitch Five showed excellent ability on 
certain instruments, but not on others Seven could identify 
the pitch only within a limited range on certain instruments 
Absolute pitch in these cases appears limited to the recognition 
of timbre The remaining nineteen persons showed fair ability 
m the identification of pitch, but made many errors and were 
slow and indecisive in their judgments 

Some people possess what Bachem calls “quasi absolute 
pitch” They know, for example, the lowest note they can 
sing, and, by applying a knowledge of musical intervals, they 
can estimate the pitch of another tone to within one or two 
semitones The judgments arc accompanied by much singing 
and humming, and usually take much more time than the judg 
ments of those possessing “genuine absolute pitch ” The recog 
nition of pitch is also called quasi absolute when the person 
relies upon a memory of a specific note with which he is very 
familiar, and tries to estimate the interval between this note 
and the unknown tone 



CHAPTER 4 


LOUDNESS 

Just as we found it necessary to distinguish between pitch and 
frequency, so must we discriminate sharply between the mean 
ing of the concepts loudness and intensity We use the word 
intensity to mean the magnitude of a sound as measured with 
the aid of instruments and expressed in terms of energy or pres- 
sure Loudness refers to an aspect of the sensation obtained by 
listening directly to a sound We measure loudness by means 
of the discriminatory responses of a normal human observer 

Although loudness and intensity bear no simple relation to 
each other, the acoustical literature abounds with statements 
implying their synonymy Loudness is not our perception of 
intensity, and the decibel, contrary to what is sometimes as- 
sumed, is not a unit of loudness The decibel is a stimulus unit 
expressing the relation between two intensities Thus confusion 
arises when experimenters express their measurements in deci 
bels, but fail to indicate to what the decibels refer A decibel 
is a measure of the ratio between two physical quantities (see 
Glossary) and is, therefore, ambiguous unless one of the quanti 
tics is stated explicitly The statement that a sound has an 
intensity of 50 db has meaning only when we know to what the 
50 db are related, i e, what zero db is (See Appendix III for 
a table of decibels ) 

The following are the most common scales for expressing 
the intensity of a stimulating sound 

1 Intensity -level indicates the number of decibels that the 
intensity of a free progressive sound wave is above the arbitrary 
re'jtrtme rrtmsrt} tVrfc 

zero db = 10 -1 * watt per square centimeter 

zero db = 00002 dyne per square centimeter 

zero db =73 8 db below 1 dyne per square centimeter 

2 Sensation-lei el indicates the number of decibels that a 

1!0 



LOUDNESS 


111 


sound is above the threshold o£ hearing at that frequency 
Sensation level can be translated into intensity lev el provided 
we know the intensity of the sound at threshold, or provided 
the obsen er’s hearing is normal and his threshold comparable 
to the curves of Fig 17 Sensation lev el provides a com ement 
scale for expressing the results of experiments in which it is 
impracticable to measure the absolute intensity of the sound, 
but m which the reduction m intensity necessary to reach the 
obsen er’s threshold can be determined 

3 Loudness-level for a giv en tone is defined as the intensity 
level of a 1000-cycle tone which sounds equal in loudness to the 
giv en tone. For a tone of 1000 cy cles, called the reference tone, 
intensity level and loudness-level are equivalent For other 
tones, loudness-level may be determined by the procedure out 
lined below (see p 123) It has been proposed (Fletcher and 
Munson, 1) that the reference tone be defined as a plane or 
spherical sound wave having only a single frequency of 1000 
cycles and listened to by an observer facmg the source The 
intensity level w ould then be the number of decibels that this 
sound is above the reference intensity, which must be deter 
mined for the sound field at the position where the listener’s 
head is to be placed * There is some possibility that the Ger 
man word phon will come to be the accepted name for the unit 
of loudness-level The definition of a phon is mathematically 
the same as that of a decibel, and, hence, whenever we refer 
to loudness-level we may substitute the word phon for the word 
deabel 

•This defin tion of loudness-level presupposes a free progressive sound 
wave to which live obverses listens with both ears Actually the dctetstoaitsoos 
of loudness-lcsel available at present were made with receivers on the ears and 
the intensity of the 1000-cycle reference tone was stated in terms of its sensa 
tion level (see Fig 44) This procedure is the more practical Hence 
most often when we shall have occasion to refer to loudness-level vve shall 
mean the loudness of a 1000-cycle tone whose intensity is a certain number of 
decibels above its threshold as determined under the actual conditions of 
listening Until certain experimental discrepancies between field and pressure 
measurements have been resolved, we shall have to content ourselves with this 
inconsistency between the formal and the practical defin tion of loudncss-IeveL 
(Seep 125) 



112 


LOUDNESS 


None of these three scales is a loudness scale Each is a 
measure of the intensity of the stimulus relative to some arbi 
trary physical standard The establishment of a numerical 
scale to represent the psychological magnitude, loudness, pro- 
vides a problem similar to that discussed m connection with the 
pitch scale (see Chapter'!) 

CRITERIA FOR A LOUDNESS SCALE 

In creating a loudness scale we should like to satisfy two 
conditions First, our scale should be applied to the attribute 
of sensation in such a way that the numbers on the scale have 
true numerical significance, which means, simply, that, if the 
numbers are manipulated according to the rules of arithmetic, 
the result (and the manipulations) correspond to a set of physi 
cal operations Second, our scale should bear a reasonable 
relation to the experience of the observer Thus, the scale 
would be satisfactory if the magnitude of the attribute of sensa 
tion to which the number 10 is assigned should appear to be 
half as great to the listener as that to which the number 20 is 
given, and twice as great as the magnitude to which the number 
5 is given 

A scale, then, which would enable us to designate the 
numerical relation between magnitudes of the attribute loud 
ness can be constructed by assigning some number N to a given 
magnitude, and the number N/2 to the magnitude which ap 
pears half as great to the experiencing individual Obviously, 
in the application of this criterion we arc limited by our ability 
to devise operations for the determination of fractional magm 
tudes of sensation Three general methods have been used to 
discover the intensity at which one tone sounds half as loud as 
another tone 

FRACTIONATION OF LOUDNESS 

1 The observer is required to make a direct estimate of the 
fractional relation between two tones sounded successively 
Several variations of this procedure are possible, and several 



FRACTIONATION OF LOUDNESS 


113 


experimenters have contributed data on the fractionation of 
loudness 

The work of Richardson and Ross appears to be the earliest 
published Their observer heard tones of two different inten- 
sities and was required to rate one of the tones as a certain 
fraction or multiple of the other Unfortunately, the inten 
sities of the stimuli were not reported in terms of acoustical 
quantities, so that comparison with later results is difficult 
Nevertheless, Churcher, by making reasonable assumptions, was 
able to demonstrate that the data of Richardson and Ross are 



Fig 41 The ordinate shows the intensity at which a tone (T,) sounds 
'ha'll as loud (open figures") or a tenth as loud (solid figures) as another tone 
(Tj) whose intensity is given by the abscissa 


in good agreement with more recent results These authors 
cite Stumpf s objection that “one sensation cannot be a mul 
tiple of another Every sensation presents itself as an indivis 
lble unit” Although this assertion may be true, it does not 
follow that we cannot establish scales of loudness, for loudness 
must be regarded as a measurable aspect of sensation Sensa 






114 


LOUDNESS 


tions themselves cannot be divided, but the numbers represent 
mg the magnitude of one of their aspects can 

Ham and Parkinson produced tones w ith a loud speaker and 
asked observers to estimate the fractional reduction m loudness 
attendant upon a known reduction in the intensity of the 
stimulus From their results it is possible to determine the 
intensity at which one tone sounds half as loud as another 
These results for a tone of 1000 cycles are shown in Fig 41, 
where we find the intensity of a tone Ti which appears to be 
half as loud as the tone T whose intensity is given by the 
abscissa 

In the experiments of Geiger and Firestone the test tone was 
presented to the observer by means of telephone receivers ap- 
plied to both ears, and he was required to change the intensity 
of a second tone until its loudness was a certain ratio of that of 
the test tone The ratios employed were 01, I, 25, 5, 1, 2, 4, 
10, and 100 Frequencies of 1000 and 60 cycles and a com 
plex tone were tried The results for the 1000-cycle tone, in 
which the second tone was set to half the loudness value of the 
test tone, arc also shown m Fig 41 These experimenters 
found that the order of presentation of the tones to be com 
pared may influence the results 

The experiments of Churchcr, King, and Davies were car 
ned out mainly with pure tones of 800 cycles Their observers 
were asked to set a second tone to a value of loudness equal to 
one half and to one fourth of the loudness of a standard tone 
These results arc also shown m Fig 41 Very nearly the same 
result was obtained from two successive halvings as from a 
single quartering 

It is clear from Fig 41 that satisfactory agreement can be 
obtained from direct subjective estimates of loudness This 
touAwmA 'A 4vk.cs. v- vb.e w/avt tuwAMwtsfcjA 

(although not necessarily the most reliable) one under the 
criteria previously laid down for the nature of the loudness 
scale The other methods are also valid m so far as they offer 
alternative ways of getting the same results 

2 An alternative method, offering the possibility of greater 



FRACTIOVATION OF LOUDN'ESS 


115 


reliability, makes use of the fact that the two ears are connected 
in such a way that a tone introduced into one ear sounds half as 
loud as the same tone introduced mto both ears The pro 
cedure, then, is to have the observer adjust the intensity of a 
tone m one ear until it sounds as loud as a given tone in both 
ears 

Such monaural-binaural equations have been earned out by 
Fletcher and Munson (1), whose results are presented.m Fig 41 
It is only because of the agreement between these data and those 
procured by direct estimate that we are able to conclude that 
loudness sums in the two ears 

3 Another method, proposed by Fletcher (4), is based on 
the fact that two tones of equal loudness, which are sufficiently 
separated in frequency as not to stimulate overlappmg areas 
on the basilar membrane, yield, when presented together, a 
loudness twice as great as either one alone By equating a third 
tone first to one and then to both, we should obtain the ratio of 
intensities corresponding to a ratio of two to one in loudness 
Here again validation of the method depends upon its ability 
to produce results comparable to those obtained by direct esti 
mate That such agreement is forthcoming can be seen from 
the data for two-component tones in Fig 41 

If the tones introduced into the same ear are too near in 
frequency, they stimulate overlappmg areas of the basilar mem 
brane, whereupon some degree of masking may occur and may 
interfere with the summation of the two loudnesses Strikingly 
different, however, is the effect when the two tones are led to 
each ear separately In this case, summation occurs, but only 
when the frequencies are close together Thus Fig 42 (Bek£sy, 
4) shows the relative intensity (sound pressure) of a third tone 
sounding in one ear and equated to the combined loudness of 
two tones sounding simultaneously, one in each ear Since the 
two tones had an intensity 40 db above threshold, the results 
for the case m which they were of the same frequency (zero on 
the abscissa) check with the data of Fig 4 1 When the two 
tones are of different frequency, and are led to the two ears 
separately, loudness does not sum It appears that, in order for 



116 


LOUDVESS 


loudness to sum arithmetically in one ear, the tones must be 
far apart in frequency — for it to sum in two ears separately, the 
tones must be identical in frequencj Of course if the tones 
in one ear are identical in frequency and phase, their intensities 
must sum In fact Bekesy (2) has presented a curve similar to 



Flo 42 Showing the relative intensity (sound pressure) of a comparison 
tone equated m loudness to a pair of tones presented dichottcally (one in each 
car) Here we see that the loudness due to a tone F increases when another 
tone, F * A F, is led to the other ear although die increase is not the same for 
all values of A F The sensation let el of F and of F + AF was 40 db 
(After Bfkfsy, 4 ) 

Fig 42 obtained with one car (This result of B£kesy’s is m 
apparent conflict with Fletcher’s finding ) 

Related to the problem of summation in the two ears is the 
observation (Stevens and Sobcl) that, in the case of binaural 
beats (see p 172), the apparent loudness is greatest at the instant 
when the tones in the two cars are exactly in phase 

Ratios of loudness other than two-to-one can be established 
by varying the number of equally loud components, provided 
they can be kept far enough apart in frequency to prevent 
masking Data for a tone of ten components (Ffetcher and 
Munson, X) arc shown in Fig 41 The tone consisted of ten 
harmonic frequencies with a fundamental of 530 cycles Each 
component was generated by an optic siren ami its intensity 
* was adjusted so that its loudness was equal to that of a 1000-cy clc 
tone whose intensity is shown by the ordinate of Fig 41 The 




THE LOUDNESS FUNCTION 


117 


1000-cycle tone was then equated in loudness to the ten com 
ponent tone, at which pomt its intensity was as shown by the 
abscissa These results should be compared to those obtained 
from observers who made direct estimates of the ratios at which 
two intensities gave a tenfold difference in loudness (see Fig 
41) As we should expect, the agreement is less than was found 
for the determination of a twofold difference, but, in view of 
the nature of the judgment, Fletcher thinks it remarkable that 
the two kinds of data determine a single curve, within what 
he terms “observational error The consistency between the 
data for twofold and for tenfold fractionation is discussed below 

THE LOUDNESS FUNCTION 
From the data in Fig 41 we can proceed graphically to 
define an intensitive function satisfying the criteria laid down 
for an acceptable loudness scale (compare the procedure for 
erecting a pitch scale. Chapter 3) This function is the one 
whose value at any given intensity is proportional to the sub 
jective loudness produced by a tone of that intensity First, 
we fit a curve to the data obtained by the ‘halving procedures,’ 
giving special weight to the points determined by the monaural 
binaural method Then we assign the arbitrary number 1 to 
the intensity of 40 db above threshold and read on the ordinate 
scale the intensity of the tone which sounds half as loud, and 
which, therefore, receives the number 05 After repeating 
this procedure both above and below our starting pomt at 
40 db, we can plot the function for the 1000 cycle tone, as shown 
in Fig 43 This function, then, satisfies the criterion that any 
value N stands for a tone which appears to a normal observer 
half as loud as that represented by the number 2 N — at least 
withm the present limits of experimental error Had we used, 
m an analogous manner, the curve in Fig 41 representing a 
tenfold reduction, we should have obtained a very similar func 
tion In fact, the striking agreement (see Fletcher, 4) between 
the functions derived from the twofold and from the tenfold 
fractionations probably justifies a high degree of confidence in 
our ability to establish a meaningful loudness scale 



LOUDNESS IN SONES 



parameter. 


old, listened to with both ears, recommends itself as the logical 
unit, because 1000 cycles has been selected as the reference- 
frequency for loudness-comparisons leading to the determina- 
tion of loudncss-lcvel, and the loudness-level of *10 db above 
threshold has been proposed as the reference-level for determin- 
ing the pitch of a tone (see p 76). Such a Unit should prove 


THE LOUDNESS FUNCTION 


119 


to be of the right order o£ magnitude for general usefulness, 
since it is only about one third of 1 per cent of the maximal 
loudness the ear can support As we shall see later (Fig 62, 
p 151), this unit corresponds in order of magnitude to the 
differential thresholds of moderately intense tones of the mu 
sical scale As a name for the unit the word sone has been 
proposed (Stevens, 7) 

Although an empirical formula relating the loudness of 
the 1000-cycle reference tone to the intensity of the stimulus 
would be a great convenience, the function of Fig 43 does not 
lend itself to simple mathematical expression On the assump 
tion that the loudness-function can be represented by a straight 
line for intensities above 40 db (Fletcher, 4), and that for low 
intensities loudness is proportional to the intensity, an equation 
can be written (Knauss), as follows, 

L = Z(10- 5/ *Z+l)* 5 l(r s sones 

where Z represents intensity in units of lOr 1 * watt per square cen 
timeter The difference between the %alues given by this 
equation and the function in Fig 43 reaches 50 per cent at 10 
and 120 db, but between these values the difference is less Still, 
for accurate computations, it is probably better to rely on a 
table or a graph of the loudness function rather than to use an 
equation 

The curve for 1000 cycles in Fig 43 is quite accurate for all 
tones between about 700 and 4000 cycles At other frequencies, 
loudness is a somewhat different function of intensity, as shown 
by the other curves of Fig 43 These curves were determined 
by equating the respective tones to the 1000 cycle tone in loudness 
(see below), and then assigning loudness-values on the basis of 
these loudness equations It should be possible, of course, to 
carry out the fractionation procedure for each frequency in 
turn and obtain curves identical to those of Fig 43 For the few 
cases in which data are available, reasonable agreement can be 
demonstrated In general, the lower the frequency the more 
rapidly does loudness grow as a function of intensity, at least for 



120 


LOUDNESS 


intensities below the 100-db level Thus a tenfold (20-db) 
increase in the intensity of tones whose loudness is 0 1 sone pro- 
duces, in a 50-cycle tone, a two-hundredfold increase m loudness, 
but only an elevenfold increase in a 1000-cycle tone At inten 
sities above 100 db, however, all frequencies below approxi 
mately 50 cycles increase less rapidly than the 1000-cycle tone 

The curve for 10 cycles was obtained from an experiment by 
Bekesy (22), in which he equated a 5-cycle and a 10-cyclc tone 
to a 50-cycle tone in loudness (see Fig 46) Bekesy reports 
that the loudness of these very low tones increases to a maxi 
mum and then declines at higher intensities The threshold 
of feeling is reached before the maximum is attained, but ap- 
parently Bekesy’s observers were able to make loudnessjudg 
ments at intensities above the threshold of feeling 

The curves of Fig 43 contain important implications for the 
problem of the reproduction of music by electrical devices The 
original quality of a selection can be maintained by a system 
which reproduces all frequencies equally effectively only when 
the intensity lev el of the reproduction is the same as the inten 
sity level of the original Thus, at the proper intensity, a radio 
transmitter and receiver of perfectly uniform frequency re 
sponse may reproduce a musical rendition whose component 
frequencies are all of the same loudness for the listener as they 
would be in the broadcasting studio, but at weaker intensities 
the selection would appear to have lost its low frequency tones 
Hence, efforts to conserve the relative loudness of the low fre 
quencies are ineffective unless account is taken of the intensity 
level at which the reproduction is to be made 

OTHER METHODS OF ESTIMATING LOUDNESS 

The loudness-scale m Fig 43 was constructed from data ob 
*esnrJL *hn. rasihm i 'ih fy'Ktoava&nv. viil */»<, xnwmv Can. 
this loudness scale be verified by other methods, such as the 
method of bisection and the method of eqtti -distances? Such 
verification is theoretically possible— in fact, it is theoretically 
required if the loudness-scale is valid The ability of any two 
methods to confirm each other is conditioned, among other 



OTHER METHODS OF ESTIMATING LOUDNESS 


121 


things, upon our ability to eliminate constant errors in the 
experimental procedures 

The Method of Bisection Under this method the observer 
is required to set a variable tone until it is midway between two 
fixed tones in loudness Objection has been made by Gage (1) 
to this method on grounds of internal mconsistency His ob 
servers first bisected a loudness-interval to obtam the halfway 
point Then they bisected the upper half to obtain the three 
quarter point and the lower half to obtain the one quarter and 
finally they bisected the distance between the one-quarter and 
the three-quarter points to yield a second halfway point This 
second halfway point should be identical with the first half 
way pomt Instead, it was consistently higher (louder) than 
the original bisection 

A repetition of this experiment, under slightly different con 
ditions (Newman, Volkmann, and Stevens), gave results which, 
unlike Gage’s, showed internal consistency These results con 
firm the demonstration by Wolff that consistency is possible, 
and suggest that, when there is a lack of consistency m the 
results of bisection, we should look for constant errors m the 
procedure Clearly, if m the method used by Gage there had 
been a slight positive error in each bisection, the cumulative 
effect would be large enough to account for the discrepancy 
observed 

When the results of experiments on bisection are compared 
with our expectations based on the loudness-scale, the situation 
becomes equivocal Most of the bisections of short intervals 
disclose excellent agreement* but bisections of long intervals 
fail disconcertingly In the case of these long intervals we are 
faced with the paradox that, when an observer sets a tone to a 
loudness one half that of a given tone, he does not do the same 
thing as when he sets a tone to a loudness halfway between 
that of the same given tone and a tone whose loudness ap 
proaches zero (Wolff) The bisection always gives a value 
lower than the fractionation, when a long interval is involved 
One of Gage’s observers, for example, bisected the interval from 
0 1 to 6 0 sones and obtained the v alue of 0.8 sone Setting a 



122 


LOUDNESS 


tone to half the loudness of 60 sones would, of course, result in 
a loudness of 30 sones What could account for this dis- 
crepancy of almost fourfold ? 

A possibility is that the observer assumes two \cry different 
attitudes under the two conditions Preliminary experiments 
(Stevens and Volkmann) have demonstrated that, in approach- 
ing the problem of bisecting a loudness interval, one can aim 
either at setting the middle tone halfway between the other two 
or at an adjustment such that the ratio of the middle tone to 
the lowest tone equals the ratio of the highest tone to the 
middle tone In other words, one can aim either at the anth 
metic or at the geometric mean Different results are obtained 
by observers having these two attitudes Thus a bisection of 
the interval from 5 to 20 sones would yield 12 5 sones under the 
first attitude and 10 sones under the second The inability of 
observers to keep separate these two attitudes might possibly 
account for both the direction and the magnitude of the discrep 
ancies in the experimental results 

The Method of Equi Distances This method is similar 
to that of bisection, except that the two intervals to be equated 
do not have a point in common An interval in one part of 
the loudness scale is set to equal an interval in another part 
The intervals may or ma> not overlap 

Here again wc are confronted with the possible source of 
error that the observer may set the tones to show what he deems 
to be cither equal distances or equal ratios When the intervals 
are small and fairly close together, however, these two attitudes 
may lead to indistinguishable results 

Wolff used this method to equate several intervals to three 
different standard intervals whose magnitude was \ery nearly 
7 sones m each case In all but tw o of the fifteen cases reported 
the adjustments were to within 1 sonc of the size of the stand 
ard intervals Hence, it appears that in these cases the observers 
aimed consistently at equal subjective distances, and their 
ability to obtain the results predicted by the loudness-function 
(Fig 43) offers interesting confirmation of the loudness scale 
erected on the basis of fractionations The results are not what 



EQUAL-LOUDNESS CONTOURS 


123 


we should predict on the assumption that these observers were 
setting the tones to produce equal ratios 

EQUAL LOUDNESS CONTOURS 

The preceding treatment of problems relating to a loudness 
scale has regarded loudness as a function of intensity With 
frequency held constant — usually at 1000 cycles— we have seen 
how loudness \ancs when intensity is altered As already 
indicated m Fig 43 , however, loudness also vanes with fre 
quency, when intensity is held constant The precise relation 
between loudness and frequency can be discovered by mapping 
what are called equal loudness contours, that is to say, by de 
termining at what intensities tones of different frequencies ap 
pear equal in loudness to a standard tone (1000 cycles) at 
various intensities 

Fletcher and Munson (1) equated tones in an earphone to a 



standard frequency of 1000 cycles — the accepted standard for 
loudness-compansons — and obtained the results shown in 


124 


LOUDVESS 


Fig 44 Here the contours are plotted against an ordinate repre- 
senting sensation level (decibels above the average threshold 
of the observers used in the experiment) The Ioudness-lc\ el in 
phons is indicated by the number on each contour, and is 
determined by the intensity of the 1000-cjcle tone Ijmg on the 
contour. Thus all tones on the contour marked 50 sound 
equal in loudness to a 1000-cycle tone 50 db above threshold 
These curves in Fig 44 represent the best set of smooth curves 
the experimenters could draw through the observed points 



pressure The dotted cun e at the top represents Wcgcl s data for the thresh 
old of feeling The parameter is designated as loudness level (first number) 
and as loudness in soncs (number in parentheses) 

Now, often it is desirable to know the course of the equal 
loudness contours as a function of the sound pressure at the 
eardrum of die listener Thus, by plotting the functions of 
^ Fig 44 against an ordinate representing pressure lc\ cl (decibels 
' above 0 0002 dyne per square centimeter), vve obtain the curves 
of Fig 45 The lowest contour (zero) is the same as the thresh 


EQUAL LOUDNESS CONTOURS 


125 


old curve for minimum audible pressure, shown in Fig 17 (p 
50), the upper curve (dotted) represents the threshold of feel 
mg, as shown in Fig 19 (p 59) The contours in Fig 45 are 
numbered m two ways The first number indicates the level, 
in decibels, or phons, of the 1000-cycle tone above threshold, 
and the second number (m parentheses) indicates the loudness, 
in sones, of the tones represented by each contour In other 
w ords, the first number represents loudness level and the second 
number loudness The threshold of feeling corresponds ap 
proxxmately to the loudness-level of 120, or to a loudness of 
240 sones 

Instead of plotting the equal loudness contours using the 
threshold for minimum audible pressure as the reference con 
tour, as in Fig 45, we could plot them on a similar grid using 
as the threshold function the curve for the minimum audible 
field, curve 2 in Fig 17 We should then obtain what have 
been proposed as the standard loudness-contours of pure tones 
under open field conditions But these contours are suspect 
The equal loudness relations (Fig 44) were determined, not 
in an open sound field under standard conditions of listening, 
but with a telephone receiver on the ear Recent measurements 
(Churcher and King) have disclosed a slight discrepancy be 
tween equal loudness contours obtained m a free field and 
those m Fig 44 Consequently, an extensive redetermination 
of loudness-relations, when the observer listens to a plane sound 
wave m a free field, will have to be made before we can adopt 
a set of standard contours representing loudness relations under 
the standard condition of listening 

Another manner of plotting the relation between the three 
variables — frequency, intensity level, and loudness level — for 
the experiment in which sound intensity is measured directly 
at the place where the head of the observer is to be (field inten 
sity) is shown in Fig 46 Here loudness level is plotted as a 
function of intensity level, with frequency as the parameter 
(These curves will need revision when more field measurements 
are available ) The curves for 5 and 10 cycles were not taken 
from the work of Fletcher and Munson, as were the other 



126 


LOUDNESS 


curves, but from an experiment reported by Bekesy (22), in 
which the 5 and 10-cycle tones were each equated to a 50-cycle 
tone in loudness The dotted portion of these tw o curves repre 
sents the behavior of loudness-level at intensities above the 
threshold of feeling Clearly, these curves pass through a 
maximum The curve for 30 cycles suggests that, if continued, 
it would likewise reach a limiting value and perhaps decline 
These curves in Fig 46 were used to determine the loudness 
functions for the lower frequencies, as shown in Fig 43 



Fig 46 Showing how loudness level vanes with miens ty level at d ueient 
frequencies (parameters) (After Fletcher and Munson 1 and B(fkfry 22 ) 


In an effort to ascertain the reliability with which observers 
can judge the relative loudness of tones of different frequency, 
Sternberg and Munson worked out the distributions of loudness 
judgments for a large group of people with normal hearing 
By the method of constant stimuli, two different tones, 100 and 
5000 cycles, were equated in loudness to a 1000-cycle standard 
Under these conditions, 97 people equated 5000 and 1000 cycles 
in loudness with a probable error of 5 2 db, and 98 people 
equated 100 and 1000 cycles with a probable error of 64 db 




RELATION BETWEEN LOUDNESS AND MASKING 


127 


With the same groups of observers (all of them inexperienced 
in auditory judgments) the threshold intensity of the 1000-cycle 
tone was determined by a similar procedure and the probable 
error of the distribution of threshold values was 3 6 db 

In order to account, if possible, for the rather wide distnbu 
tion of loudness-judgments among these observers, several 
factors have been investigated with the following results 

1 The effect of experience is unimportant Experienced 
observers differ from each other as much in judgments of loud 
ness as do inexperienced observers 

2 Variations in the judgments of a single observer at differ 
ent times cannot account for all the scatter among the group 
The deviations of repeated tests by a single observer tend to be 
smaller, by about one half, than the deviations of a number of 
observers making one test each 

3 For these people, whose hearing is essentially normal, 
the loudness-judgment is apparently not greatly dependent 
upon the acuity of hearing 

4 Differences m the effective intensity of the stimulating 
tones after they have reached the organ of Corti probably ac 
count for only a small part of the deviations 

All these factors together probably account for some, but 
certainly not for all, of the scatter among the loudness judg 
ments of normal listeners The judgment itself, like all psy 
chological judgments appears to be inherently variable 

RELATION BETWEEN LOUDNESS AND MASKING 

There is an mterestmg and important relation between the 
loudness of a sound and its ability to mask other sounds 
(Fletcher and Munson, 2) Masking is defined as the change 
in the threshold of one tone due to the presence of another, 
the masking tone This change is measured m decibels (see 
Chapter 8) Now, the fact that the threshold of a tone is raised 
by the presence of a masking sound may be taken to indicate 
that some of the receptors on the basilar membrane are already 
excited by the masking sound The amount that the tonal 
threshold needs to be raised m order to override the effects of 



128 


LOUDNESS 


the masking sound may be regarded as a measure of the extent 
of nerv ous activity created by the masking sound The amount 
of this masking may be considered, furthermore, to represent 
the contribution to loudness arising from that region of the 
basilar membrane which corresponds to the frequency of the 
masked tone In other words, when we consider a small unit 
of extent on the basilar membrane, masking, excitation, and 
loudness are functions of one another Then, if these elemcn 
tary assumptions are valid, it follows that the area under a 
masking audiogram should be a definite function of the total 
loudness of the masking sound 

Wc can express these relations mathematicnlly as follows 

dL = F(M)dx 

where dL is an element of loudness, F(M) is a function of the 
masking and is the loudness per unit length, and dx is an cle 
ment of length on the basilar membrane (this clement is taken 
as 1 per cent of the length of the membrane) 

Then, if we assume that each unit of length of membrane 
contributes equally to the total loudness when the excitation 
of all units is equal (as measured by masking), wc can integrate, 

L—J F(M) lx 

Now, if we could obtain a sound which would give a masking 
audiogram having a constant value over a certain range of fre 
quencies and zero value for all other frequencies, wc could 
determine its loudness and its extent on the basilar membrane 
(from the curve in Fig 35, p 96), and thereby solve the cqua 
tion above for the value of F(M ) But no such sound can be 
obtained The frequencies in an acoustic spectrum can be 
confined to a definite band, but the masking will always trad 
off gradually, especially above the frequency band 

In order to circumvent this difficulty, Fletcher and Munson 
used a wide band of thermal noise having a continuous spectrum 
and capable of stimulating practically the entire length of the 



RELATION BETWEEN LOUDNESS AND MASKING 


129 


basilar membrane. The intensity-profile of this sound-spec- 
trum was adjusted so as to mask equally tones of all frequencies. 
Then, by measuring both the subjective loudness and the mask- 
ing produced by this noise, the authors were able to compute 
the function, F{M\ and obtain the curve shown in Fig. 47. 



MASKING, M, IN DB 


Fig 47. Loudness as a function of masking The ordinate gives the loud 
ness, F(M), contributed by a small unit of length (1 per cent or OJ mm) of 
the basilar membrane when it is excited to such an extent that it would produce 
a masking equal to the value M Example when a portion of the basilar 
membrane is excited by a sound which would raise the threshold of another 
sound, stimulating the same region, by 57 db, each unit (1 per cent) of the 
membrane m that region contributes a loudness of 10 3 millisones to the total 
loudness of the sensation (After Fletcher and Munson, 2 ) 


This curve gives the loudness, in millisones (1 millisone = 0.001 
sone), resulting from uniform excitation of 1 per cent of the 
basilar membrane. (It must be remembered that excitation is 


130 


LOUDVESS 


here defined as masking Its relation to phj siological processes 
m the ear has not been fully determined ) 

Knowing the curve of Fig 47, one can proceed to calculate 
the loudness of a noise First, the masking audiogram of the 
noise is found by experiment This audiogram is the cur\e 
representing the threshold of all audible frequencies when 
measured in the presence of the masking sound Next, the 
loudness-values F(M) are read from Fig 47 and are plotted 
against a scale representing the linear extent of the basilar 
membrane Finally, a graphical integration of the area under 
the resulting curve gives a value proportional to the loudness 
of the noise in milltsoncs Fletcher and Munson (2) have 
developed special charts to facilitate these computations 

THE LOUDNESS OF MULTI COMPONENT TONES 

The methods already outlined are adequate for dealing with 
the loudness of single pure tones and of sounds having a rcla 
tively continuous spectrum, such as thermal noises Efforts to 
calculate the loudness of tones composed of a limited number 
of separate frequencies have met with only meager success 
The case is simple enough when components are added which 
are sufficiently separated in frequency as not to stimulate over 
lapping areas on the basilar membrane, for then the loudnesses 
of the tones add in a simple arithmetic manner But when 
there is overlap, and one tone begins to mask the other, com 
plications arise, and the loudnesses arc not simply additive 
The resulting effects must, in general, be determined by evpcri 
ment 

A particularly interesting example is that in which all the 
component tones are in harmonic relation The sets of curves 
in Figs 48 and 49 represent the relation between loudness-level 
and intensity level for tones having such a structure (Fletcher, 
3) Each complex tone had ten harmonic components all 
equally intense The numbers attached to each curve give the 
fundamental frequency of vibration In Fig 48 the curve for 
the 100-cycle pure tone is included for comparative purposes 
It is seen that changing the overtone structure from no over- 



RELATION OF LOUDNESS TO THRESHOLD OF HEARING 131 


tones to nine equally intense ones has increased the loudness- 
level from 20 db to 60 db for the particular tone having a 
fundamental frequency of 100 cycles and an intensity level of 
51 db As seen in Fig 43, this corresponds to a change in 
loudness from 01 to 6 0 sones, or an increase of sixtyfold In- 
creases in loudness are produced on all tones by such a change 
m overtone structure, but the increases are not so great for the 
higher frequencies, or for the higher intensities It will be seen 



momc frequencies each at an intensity level as shown by the abscissa The 
fundamental frequency /, is indicated for each curve (Fletcher 3 ) 

that, of the ten-component tones, those having a fundamental 
frequency between 400 and 800 cycles are the loudest These 
quantitative results show why it is easy to increase the loudness 
of a musical tone by mcreasing its o\ ertone content, a practice 
which is common m producing musical tones Practically all 
the loudness of the tones from the piano strings of low pitch 
is due to the higher overtones 

RELATION OF LOUDNESS TO THE THRESHOLD 
OF HEARING 

It can be seen from Fig 44 that when a person s threshold 
is normal, the equal loudness contours bear a definite empirical 
relation to the threshold of hearing Is this relation the same 
for the ear whose threshold, at certain frequencies, is abnormal ? 



132 


LOUDVESS 


The answer to this question reveals interesting aspects of the 
mechanism of the perception of loudness 

In general, if the threshold of hearing at a gn cn frequency 
is above normal, the perception of loudness at high intensities 
may or may not be normal Thus, m a senes of tests (Stein- 
berg and Gardner) several people having some degree of uni 
lateral deafness, 1 e , one impaired and one normal ear, w ere 
required to make a tone heard with the deafened car equal, 
m loudness, to a tone heard with the normal ear For some 
people, the impaired ear heard less well than the normal ear 
for all sound levels For others, tones which were well above 
the threshold of the deafened ear were heard about equally 
well with either car In other words, such deafened ears tended 
to hear loud sounds with almost normal loudness People w ith 
this type of deafness are the ones who seem to hear as well 
as normal people when they arc in noisy surroundings The 
reason is obvious There is also an intermediate case where 
hearing is unproved with intensity, but does not become fully 
normal 



Fic. 50 The intensities of two tones one in a normal rar and one in a 
deafened car, which produce equal loudness Three general types of deafness 
arc indicated by these curves (After Steinberg and Gardner ) 


Figure 50 illustrates these three types of ears Each plot 
represents the intensity level at which a 2000-cycle tone in the 
normal ear sounded as loud as a 2000-cycle tone m the impaired 
l ear If both ears were normal, the observed points would, in 
each example, lie along the dotted line Plot A shows the type 



RELATION Or LOUDNESS TO THRESHOLD OF HEARING 03 


of deafness in which the increase in intensity needed to reach 
threshold is the same as that necessary to yield equal loudness 
at higher intensities In plot B, however, although the thresh 
old of the impaired ear is elevated 60 db above the threshold of 
the normal ear, equality of loudness is achieved when the inten 
sity level in both ears is 100 db Hence, in this impaired ear, 
loudness grows much faster as a function of mtensity than it 
does in normal ears Plot C represents an intermediate type 
of hearing loss hearing improves at high intensities but does 
not reach normal The solid curve in each plot represents the 
relation of loudness in the two ears as calculated by an empirical 
method (see below) 

We can account for the curve of plot A if we assume that 
all frequencies entering the ear were attenuated approximately 
30 db before they became effective for sensation This type of 
deafness is common to defects in the middle ear The curve 
of plot B, however, cannot be explained without considering the 
nature of loudness Referring to Fig 43, let us assume that 
there is a loss of 6 sones m the loudness of the tone Such a loss 
might result from a deficiency in the total number of neural 
elements which normally contribute to give a tone loudness — 
a case of ‘nerve-deafness,’ so called Subtracting 6 sones from 
the loudness function at all intensity levels, we obtain curve B 
in Fig 51 Curve A is the normal loudness function Clearly, 
at high intensities an ear, having an assumed loss of 6 sones, 
tends to hear with practically normal loudness, although below 
60 db the ear is deaf and hears no loudness at all This picture 
agrees precisely with that presented in plot B of Fig 50 Hence, 
it appears that the variable type of deafness — where hearing 
becomes normal at high intensities — is associated with a con 
dition which results in a fixed reduction in loudness, as 
contrasted with a fixed reduction in effective mtensity for the 
other type of deafness 

If, as seems probable, the variable type of deafness occurs 
when there is a deficiency of neural elements, the hearing loss 
caused by a masking sound would be expected to be of the 
variable type The nerve fibers which are activated by the 



134 


LOUDNESS 


masking sound arc ineffective in contributing to the loudness 
of another tone heard m the presence of the masking sound 
In other words, masking is an effective means of decreasing the 
available supply of neural 
elements and of producing 
the equivalent of a variable 
deafness 

These conclusions were 
borne out under experimental 
test The observer adjusted a 
tone heard in his one un 
masked ear until it sounded 
as loud as a tone of the sime 
frequency heard in his other 
ear, which was being masked 
by a thermal noise For tones 
of all frequencies from 250 
to 8000 cycles, the threshold 
in the masked ear was raised 
by about 40 db, but at high 
intensities the loudness in the 
masked car was approximate 
ly equal to the loudness in 
the unmasked car In other 
words, the results resembled those shown in plot B of Fig 50 
and showed that the hearing loss due to masking is of the 
variable type 

The Calculation of Hearing Loss Wc have already noted 
the relation between the masking effects and the loudness of a 
sound for normal ears If, from the masking audiogram of a 
sound its loudness to a normal ear can be calculated, the loud 
sssss heard by an ear having a varjahJe type of deafness should 
be susceptible to the same methods of attack For the normal 
ear, the area under the masking audiogram (plotted against 
the proper coordinates) is proportional to the loudness of the 
masking sound For the ear has mg \ amble deafness, the 
area under the hearing loss audiogram is proportional to the 



OB ABOVE REFERENCE LEVEL 
Fic 51 Sho vmg how a defect 
which produces a fixed loss in loudness 
(6 sones) affects the loudness- function 
Curve A represents the normal func- 
tion curve B a loss of 6 sones The 
variable type of hearing loss presumably 
causes a fixed loss of loudness as meas- 
ured in sones 





Fig 52 The loudness-patterns, F(A/ r ), produced on the basilar membrane 
by a 2000-cycle tone at the various sensation levels indicated by the numbers 
attached to the curves Each curve relates the loudness per unit length (I per 
cent, or 03 mm) of the basilar membrane to the position of the excitation 
on the roembranr, and the area under each curve is proportional to the loud 
ness of the tone By means of the function in Fig 47, this loudness per unit 
length was obtained from the masking audiogram of the tone The masking 
audiogram of a thermal noise was used to calculate its loudness-pattern, 
F(Mw), and the heanng loss audiograms of two ears were converted into 
loudness loss patterns, F(Mnt) In the presence of the thermal noise, or of 
the heanng losses, the loudness of the 2000-cycle tone is given by the area 
betueen the solid and the appropriate broken curve. (After Steinberg and 
Gardner ) 

Figure 52 illustrates this method The solid curves repre- 
sent the loudness-patterns for a 2000-cycle tone at various sensa- 



136 


LOUDNESS 


non levels These curves are the masking audiograms of the 
2000-cycle tone after they have been transformed into functions 
expressing loudness per unit of length (03 mm) of the basilar 
membrane This transformation was made with the aid of 
the function F(M ) in Fig 47 and a function like that in Fig 
35 (p 96) The solid black points represent the loudness- 
pattern obtained, in a similar way, from the masking audiogram 
of a thermal noise, and the circles and crosses give the pattern 
of loudness loss for the two cars of a deafened person Now, 
by simply measuring the areas under the parts of the normal 
loudness-patterns which rise above the patterns of loudness-loss, 
we obtain the loudness, as heard by the affected cars Likewise, 
the area enclosed between the normal loudness pattern and the 
masking pattern gives the net loudness of the 2000-cycle tone 
when heard in the presence of the masking noise 

It was by this method that the solid curves of Fig 50 were 
determined The agreement between the calculated and the 
observed loudness is rather satisfactory In order to obtain the 
curve in plot C of Fig 50, it was, of course, necessary first to 
subtract from the measured hearing loss the amount not due 
to the variable type of deafness 

DIFFERENTIAL SENSITIVITY TO INTENSITY 

The smallest detectable change in the intensity of a tone 
determines the intensitivc differential sensitivity of the car 
This sensitivity may be strictly defined as the reciprocal of the 
just noticeable change, or DL (difference limcn) Interest in 
the measurement of DL s dates from the time of E H Weber, 
who proposed the rule that the ratio of the DL to the intensity 
at which it is determined (the Weber fraction) is constant for 
any sense department This ratio is also called the relative dtj 
fcrencc Ivmt n, and we Uoaw caw that. Weber was. mistaken 
about its constancy Early efforts to measure auditory sensi 
tivity suffered from technical difficulties which necessarily con 
demned them to incompleteness hnudsen (1) renewed the 
earl) work and set about to explore the DL over a wide range 
of intensity His work was later superseded by a still more 



DIFFERENTIAL SENSITIVITY TO INTENSITY 


137 


thorough set of measurements by Riesz (1). It is these meas- 
urements which we shall examine in detail. 

Riesz presented his tones monaurally by means of a special 
moving-coil receiver designed to be especially free of distortion. 
The receiver was connected to the outputs of two oscillators in 
such a way that both oscillators activated the receiver simul- 
taneously and produced beats when the frequencies of the two 
impressed tones were close together. First the tone from one 
oscillator was presented at a definite sensation-level and then 
the intensity of the tone from the other oscillator was increased, 
from a point near zero, until the observer was just able to detect 
a beat. From the intensities of sound needed to obtain this beat, 
the intensity at the maximum and at the minimum of the beat 



rate of variation in the intensity of a tone. The curves are for the sensation- 
levels of 25 and 50 db. (After Riesz, I.) 

couia be calculated. Tbe difference between tbe maximum 
and the minimum was taken as defining the DL. 

Although the type of transition from minimum to maxi- 
mum intensity by the method of beats has not usually been 
employed in measurements of differential sensitivity, it possesses 
the advantage that it produces a simple fluctuation in intensity 
which is not complicated by the possible presence of an undeter- 
mined number of transients. If the transitions from the weaker 
to the louder tone are made abruptly, some of the energy will 




138 


LOUDNESS 


be scattered to frequencies higher and lower than the impressed 
frequency. These transients may be audible, and provide the 
observer with a false clue. 

The size of the DL was found to be a function of the rate of 
the fluctuations in intensity. A representative curve showing 
the size of the relative DL as a function of the rate at which 
the beats were presented is given in Fig. 53. It is characterized 



Fig 54 The relation between the difference limen (At) and sensation 
level, at various frequencies (parameters) The relative difference limen is 
shown on the scale at the right. The size of At in decibels is equal to 10 log 
(l + At /l). (Data from Riesz, 1 ) 

by a broad minimum in the neighborhood of 3 cycles, and this 
rate was adopted for the experimental determination of the DL 
for intensity. 

Average curves giving the size of the relative DL as a func- 
tion of intensity (sensation-level), with frequency as the paratn- 
\ cter, are shown in Fig 54. At a given frequency the relative 
differ encc-hmen approaches a constant value for intensities 
above 50 db, but increases rapidly as the intensity is reduced 



DIFFERENTIAL SENSITIVITY TO INTENSITY 


139 


toward the auditory threshold (See Table II for tabulated 
data ) 

Figure 55 shows the behavior of the relative DL as a function 



Fic 55 The relation between the DL and frequency, at various sensation 
levels (After Riesz 1) 

of frequency for different values of the parameter intensity 
The relative DL is a minimum at a frequency of about 2500 
cycles, although the minimum is less sharply pronounced at 
high intensities than at low The region of the greatest differ 
ential sensitivity of the ear corresponds to the frequency range 
of greatest absolute sensitivity 



140 


LOUDNESS 


TABLE II 

Differential sensitivity to intensity at various frequencies and sensation 
levels Two entries appear at each frequency and sensation level The 
upper entry gives the value of A/// (in terms of energy) and the lower value 
(italicized) gives the value of A / in decibels These data were obtained from 
12 observers at the Bell Telephone Laboratories (Printed by permission ) 



It is of interest, at this point, to inquire into the nature of 
the stimulus used by Ricsz for obtaining his DL’s, and to com 
pare it with the stimulus used to measure the DL’s for frequency 
The stimulus for discrimination of frequency v. as obtained by 
modulating the frequency of a tone, and, as outlined in the 
preceding chapter, it consisted of a group of five or more steady 
components which, by beating with each other, set up a migrat- 
ing disturbance on the basilar membrane The spectrum of 
Riesz’s stimulus was, of course, much simpler, for it consisted 
of only two steady components, a large and a small, spaced 3 
cycles apart How are we to regard the action of these com- 
ponents ? 

Each component sets up on the basilar membrane a disturb 
ance whose maximal amplitude may be represented schemati 



FACTORS INFLUENCING DIFFERENTIAL SENSITIVITY 


141 


cally by the solid curves of Fig 56 The two components differ 
in frequency, and, consequently, the two overlapping dis 
turbances on the membrane 
are alternately in and out of 
phase with each other 
When in phase, they rein 
force to produce a net disturb 
ance corresponding to the 
dotted curve whose maxi 
mum is at point A When 
out of phase, point B repre 
scnts their maximum 
Clearly, as the maximum 
moves from A to B , not only 
is there an increase in the total disturbance, but the maximum 
moves laterally along the membrane by a slight amount Ap- 
parently this lateral displacement is too small to produce a 
noticeable difference in pitch, although conceivably it could 
do so 

Incidentally, the lateral excursion of the disturbance could 
be prevented by introducing another side band on the other side 
of the large component With the two side bands of the same 
amplitude and the proper phase, the line from A to B could 
be made exactly vertical, and we should have a case of pure 
amplitude modulation (see Chapter 9) It is doubtful, how 
ever, that the results of a pure amplitude modulation would be 
different from those reported by Riesz 

FACTORS INFLUENCING DIFFERENTIAL 
SENSITIVITY 

Although we must accept the results of Riesz as the most 
comprehensive and satisfactory measurement of differential 
sensitivity available at present, it is clear, from the lack of 
agreement among previous investigators, that the values ob 
tamed depend to a large extent upon experimental conditions 
We shall consider some of these factors 

Monaural versus Btnaural Observation Earlier results did 



Fig 56 Showing how the pattern 
of disturbance on the basilar membrane 
changes when a faint tone beats with a 
loud tone. Two dotted curves show 
the extreme positions of the disturb- 
ance due to the alternate reinforcement 
and cancellation of the two solid curves 



142 


LOUDVESS 


not disclose the fact, but the claim has recently been made, 
that in binaural listening it is possible to detect a change in inten 
sity 15 to 30 per cent smaller (on a decibel scale) than that 
which is perceptible in monaural listening (Churchcr, King, 
and Davies, Upton and Holuay) Whether or not so large a 
difference would have appeared in Riesz’s results had he used 
binaural instead of monaural listening is problematic, but the 
evidence supports the notion that auditory discrimination of 
intensity is finer when both ears are involved 

Duration of the Tones In both monaural and binaural 
listening the DL is smaller when the duration of the tones is 
greater Upton and Holway showed that the decrease in the 
size of the DL is related to the exposure time of a tone according 
to an exponential function 

Transition between Tones For optimal conditions the 
transition between the tones to be compared should be abrupt, 
instantaneous, and silent A gradual transition, such as the 
sinusoidal variation used by Riesz, is less easy to detect than an 
abrupt transition, but, as already suggested, an abrupt transition 
may involve the production of unwanted transients 

Any interval of silence between the tones decreases the sen 
sitivity of the ear to a change of intensity The introduction of 
an interval of half a second, under certain conditions, increases 
the required intensity change by a third (Montgomery) Un 
der other conditions, going from a one third second to a three 
second interval increases the required change twofold 

The effect of transition time can be strikingly demonstrated 
(Rawdon-Smith and Gnndley) by presenting an observer with 
a tone which is increased very slowly in intensity and then de 
creased suddenly to the original value When this process is 
repeated, the observer reports hearing a tone which, by discrete 
jumps, grows less and less loud Objectively, of course, the in 
tensity is the same at the end of each jump 

Control of Presentation Where two discrete tones are being 
presented for comparison, it is important that the observer be 
able to control the exact instant of transition from one tone 
to the other Thus, Montgomery found that the required 



NATURE OF THE DIFFERENCE LIMEN 


143 


intensity-change was reduced by one-half when the observer 
was allowed to operate the switch himself. This seemed to be 
due to the fact that, under these conditions, the observer could 
be prepared for the change at the exact instant it occurred. 
A somewhat better judgment is also obtained when the observer 
is permitted to listen to the tones as many times as he desires 
before making his decision as to which is louder. 

An illustration of the influence of some of the factors affect- 
ing differential sensitivity is given in Table III. 

All the values in this table were obtained from the same 
observer listening monaurally to a thermal noise at 40 db above 
threshold. Similar results were obtained using a 1000-cycle 
tone. 


TABLE 111 


Condition 

Decibels 

A/// 

1 Switch not controlled by jubject, one comparison, half 
second interval between tones 

0 8 

0 20 

2 Same, except no interval between tones 

0 6 

0 15 

3 Repeated comparisons, no interval between tones . 

4 Switch controlled by subject, repeated comparisons, no 

0 4 

0 096 

interval between tones 

0 2 

0 047 

5 Sinusoidal variation (continuous presentation) (cf Riesa) 

0 5 

o n 


NATURE OF THE DIFFERENCE LIMEN 

Under conditions designed to measure the DL, the observer’s 
response is always variable. This phenomenon of fluctuation, 
familiar to everyone who has made psychophysical measure- 
ments, necessitates the adoption of some statistical criterion for 
determining the value of the DL. The value usually selected 
is the difference which the observer is able to detect 50 per cent 
of the time. Smaller values would be detected less and larger 
values more than 50 per cent of the time. 

In order to illustrate how the instantaneous sensitivity of the 
ear varies from time to time, Montgomery has defined a quan- 
tity S which is variable with time in such a way that at any 







144 


LOUDNESS 


instant the ear is able to perceive any increment of intensity 
greater than S, but is not able to perceive an increment less 
than S. In his experimental vv ork, a definite value of the incre- 
ment was chosen, and the proportion of the time that the 




J5 


-e 






• 400B 

A. 


\ " 




f 








Fic. 57. Showing how the sensitivity of an observer vaf*« with time The 
quantity S is a measure of the instantaneous sensitivity of the ear, and is 
determined by the value of the increment which, at any instant, would be just 
detectable. The area under a curve, between any two limits, is proportional 
to the probability that S will be between those limits at any instant. Each 
curve is for a different sensation level, as indicated (Montgomery ) 

} observer could detect this increment was taken as defining the 
proportion of the time that S had a value less than the incre- 
ment. With this interpretation we may regard, the curves 




NATURE OF THE DIFFERENCE UMEN 


145 


of Fig 57 as the distribution curves of S Then, the portion 
of the area under one of these curves, between any two limits, 
is the probability that S will be between those limits at any 
instant Actually, these curves were derived from the slopes 
of the psychometric functions obtamed from the experiment 
(See Guilford for a discussion of these functions ) The curves 
of Fig 57 enable one to appreciate readily the manner in which 
the sensitivity of the ear vanes with time 

The curves of Fig 57 show a distribution of sensitivity such 
as is usually found in a psychophysical experiment These 
curves exhibit a resemblance to the ‘normal curve’ for the dis- 
tribution of chance errors, and it has usually been assumed that 
they are, m fact, the result of chance variation on the part of 
the observer However, in order to obtain ‘normal curves’ 
under these conditions, we must assume, not only that sensi 
tivity vanes in random fashion with time, but also that some 
of this variation occurs between or during the presentation of 
the two tones At least such an assumption is required if we are 
to conceive of differential sensitivity as being quantal in nature 
Let us examine the quantal notion more closely 
The simplest assumption to be made about discrimination is 
that the organism can detect an increment to a stimulus when, 
and only when, the increment is large enough to excite one 
additional ‘neural unit' (nerve fiber ? ) Then, the size of the 
necessary increment will depend upon how far the previous 
stimulus has exceeded the threshold of the last excited unit As 
depicted schematically in Fig 58, a noticeable difference will 
occur when an amount A is added to the stimulus, because then 
(he next ' neural unit’ NU will be excited 

Now, suppose the over all sensitivity of the organism is m 
random fluctuation Or, what amounts to the same thing, 
suppose the height of the stimulus column m Fig 58 varies in 
chance fashion with time Then, the size of the necessary 
increment A will vary from zero to the size of the interval NU 
But, since one size of the necessary increment is as probable as 
any other, the probability that any given increment will be 
noticed is equal to the ratio of A to NU This simple relation 



146 


LOUDVESS 



will hold provided no change in sensitivity occurs between or 
during the presentation of the two stimuli to be compared If 
these conditions arc fulfilled, the psycho- 
metric function should become a straight 
line, and the cunes for the distribution of 
sensitivity, as shown in Fig 57, should be 
come rectangular instead of bell shaped 
Can such results be realized in a concrete 
experiment f 

Bekesy (6) presented a tone lasting only 
03 sec, followed immediately b> a second 
tone of the same duration and of variable in 
tensity The observer reported whether or 
not he noticed a difference in loudness be 
tween the two tones The tones were pro- 
duced in earphones which were incorporated 
in a bridge circuit The circuit w as tuned in 
such a way as to suppress the noise arising 
from the operation of switching from one 
tone to the other Under these conditions 
it was possible to obtain the results shown 
in Fig 59 PJot A represents the distribu 
tion of judgments obtained with observers 
less practiced or less sensitive Apparently 
two additional neural units are needed to 
produce a noticeable difference Ev cn w ith 
sensitive observers, such a distribution can 
be obtained when the tones are very weak, 
or when some disturbing factor is intro- 
duced into the experiment Plot B is of 
greater theoretical interest It is the sort 
which BekSsy obtained from well practiced 
observers, under the most ideal conditions, and it agrees precisely 
with what wc should expect on the basis of our assumption re 
gardmg the quantal nature of discrimination Hence, if 
B£k£sy’s results can be substantiated m future experiments, we 
shall have good reason to accept the notion that the mechanism 


Fic 58 Schema 
illustrating the na 
ture of differential 
sensitivity accord 
mg to the assump- 
tion that it is a 
quantal phenome 
non & is the in 
crement which must 
be added to the 
stimulus in order to 
excite an additional 
neural unit The 
neural unit may 
or may not corre- 
spond to an anatom- 
ical unit, such as a 
single nerve cell 
(After Bikesy 6) 



THE INTEGRATION OF DIFFERENCE LI MENS 


147 


of discrimination is fundamentally quantal m nature, although 
this fact is normally obscured by a random fluctuation m the 
over all sensitivity of the organism 

In both plots the distributions fail to be symmetrical about 
the point of objective equality of the two tones (zero on the 
abscissa) Bekesy explains this as being due to the fact that the 



A i 
i 

Fic 59 Showing the distribution of judgments regarding the sameness or 
difference of two tones which, differ by an amount A/ (After Bck&y, 6 ) 


first tone is of such short duration that it does not build up to 
the full loudness it would have if continued longer Conse 
quently, when the second tone follows at the same intensity, 
the observer experiences a growth in loudness, due merely to 
the factor of duration This factor (or time-error) operates to 
sb.'fe. all id. such, a. Nta.'j that tb/t vma y i&g-ji. 't'ywi 

100 per cent of the time is a tone slightly less intense than the 
standard tone (See Boring, 3, for discussion of the time-error ) 


THE INTEGRATION OF DIFFERENCE LIMENS 

From the data contained in Fig 54 it is possible to find the 
number of dtscrimmable steps m loudness when proceeding 
from one intensity to another A satisfactory method of de 
termimng this number is to plot the function 7/A 7 (reciprocal 




of the relative DL) against log I (or the decibel scale) and 
measure the area under the resulting curve This area is pro- 
portional to the number of DL’s between the limits bounding 
the area 

Figure 60 shows the result of an integration of the DL’s for 
intensity (Riesz, 2) These curves show that the total number 
of discrimmable steps between two sensation levels is different 
at different frequencies Integrations taken all the way from 
the threshold of audition to the threshold of feeling, at various 
frequencies, show that the maximal number of steps occurs 
between 1000 and 2000 ejeles (Riesz, I) 


■ 

■ 

■■■ 

■ 

■ 

■ 

■ 

B 

■ 

■ 


■ 

■ 

■ 

a 

■ 

■ 

■ 


■ 

■ 


■ 

SI 

■ 

■ 

■ 

■ 


■ 

■ 

K 

a 

■ 


■ 

■ 

■ 


■ 

B 

■ 




■ 

■ 

ft 

5 

4 

■ 

■ 

■ 

■ 

■ 

■ 


■ 

■ 

■■■ 

■ 

■ 

■ 

■ 





■ 

■ 

■ 

r 







■ 

■ 

r. 

a 



■ 



■ 

■ 

K 

0 



■ 



■ 

p 

Si 

■ 


■ 


■ 

m 

5 

a 

■ 

■ 

■ 

■ 

■ 




Fie. 60 Showing the relation between the number of DL i (ordinate) and 
the number of decibels (abscissa) that a lone is above threshold These 
curves were obtained by integration of a function relating At to intensity 
(Riesz, 2 ) 


The Subjective Size of Difference Ltmens for Intensity 
Now that we have a scale for the measurement of the subjec- 
tive magnitude, loudness (Fig 43), it becomes possible to 
answer the question which has agitated psychologists since the 
days of Fechncr regarding the subjective size of an mtensitive 
DL Fechncr assumed that all DL’s are subjectively equal and 




THE INTEGRATION OF DIFFERENCE LI MENS 


149 


proceeded forthwith to integrate them in an effort to determine 
the magnitude of a sensation. However, it has been found that 
summating the same number of DL’s for two tones of different 
frequency does not yield equal loudnesses (Newman). Work 
on equal sense-distances also failed to confirm Fechner’s assump- 
tion, but suggested that the DL's at high intensities are sub- 
jectively larger than those at low intensities, although it could 
not be said how much larger (Titchener, 2) . It should be noted 
that we are not here concerned with the constancy of the Weber 
fraction A 1/1, which was another of Fechner s assumptions, but 
only with the problem of subjective magnitude, be., the ability 
of an added just-noticeable difference to contribute always the 
same increment to the total subjective effect of the stimulus. 

If Fechner’s assumption regarding the subjective equality of 
DL’s were correct, the summated DL’s (Fig. 60) would yield 
a function proportional to the loudness-function of Fig. 43. 
These two functions are not proportional, but from their forms 
we can proceed to determine their relation, and thereby to meas- 
ure the one in terms of the other. The relation turns out to 
be almost a power function, as is shown by the fact that 
straight lines are obtained in logarithmic coordinates when 
the one function is plotted against the other (see Fig. 61, and 
Stevens, 7). If the lines here are taken as defining the rela- 
tion between loudness and the number of DL’s above thresh- 
old, the equation ^ __ z 

can be written. Here L is loudness and N is the number of 
DL’s above threshold. The constant K can be determined from 
fne intercepts of the Vines with the loudness-axis. Tne exponent 
is the same for all frequencies, because the slopes of the lines are 
the same. Of course, since the data for the higher frequencies 
could best be fitted by curves slightly concave downward, the 
exponent is not strictly constant, but the data probably do not 
warrant more precision in the determination of the exponent. 
On Fechner’s assumption this exponent would be unity. 

Now, to measure the size, in sones, of the first DL above 
threshold, we may set N = 1, and then the size becomes equal 



150 


LOUDNESS 


to the value of K. The value of K varies with frequency, and 
is smallest for frequencies near 3000 cycles. Not only does the 
subjective magnitude of a DL depend upon the frequency of a 
tone, but it varies also as a function of the number of the DL 
above threshold. This relation is shown in Fig. 62. The equa- 



ls CO 20 

LOG LOUDNESS 


Tig 61. The relation of loudness (in sones) to number of DL's The 
points for 7000 cycles have been shifted 05 logarithmic unit upward on the 
ordinate scale, in order to facilitate plotting L represents loudness in sones, 
N the number of DL’s above threshold, and AT is a constant determined by 
the intercepts of the lines on the axis of log loudness at 2 cro value of the 
ordinate. The values for K are .0070, 00112, 00028, 00070 for 200, 1000, 
4000, and 7000 cycles res pec tu ely (Stevens, 7 ) 

tion in this figure relates the size of a diffcrencc-limen DL to its 
! number N, and was obtained by differentiating die previous 
equation. The vast disparity between the subjective magni- 
tudes of different DL’s is clearly apparent. Hence, their inte- 




THE INTEGRATION OF DIFFERENCE LIMENS 


151 


gration for the purpose of obtaining a numerical scale of loud 
nesses is not permissible 

Before leaving the topic of the subjective size of the inten 
sitive DL, we should note the important implication for psy 
chophysiology of the fact that the DL’s are not equal The 



Fic 62 The subjective magnitude of the DL s as a function of their num 
ber above threshold K and N are the same as in Fig 61 Frequency is tl e 
parameter (Stevens 7) 

hypothesis that a just noticeable difference occurs in a sensation 
when an additional ‘neural element’ is brought into activity is 
attractive for its simplicity Equally attractive is the simple 
notion that loudness is proportional to the number of active 
‘neural elements’ (see Chapter 16) The meaning of Figs 61 
and 62, however, is that these two notions are incompatible If 
loudness is proportional to the number of active elements, a DL 
cannot be founded upon the addition of a single active element 
Of course, it may be that neither of these relations between 
neural elements’ and sensation is true — we do not know — but 
the ultimate solution of the psychophysiological problem of 


152 


LOUDNESS 


loudness will have to account for the functions depicted in Fig 
61 (cf Stei ens and Davis) 

THE TOTAL NUMBEB OF DISTINGUISHABLE 
TONES 

One more interesting question relating to difference limens 
deserves our attention Once we know the values, throughout 
the audible range, of the DL’s for frequency and for intensity, 
we can proceed to calculate the total number of pure tones that 
the car can distinguish from one another When we hold in 
tensity at a medium value and vary frequency alone, there are 
about 1500 just noticeable steps from the lowest to the highest 
audible frequency When we vary only the intensity of a tone 
in the middle range, vve discover about 325 just detectable steps 
in loudness The total number of tones that can be distin 
guished m any respect— cither m pitch or in loudness— is the 
product of the number of DL’s for frequency and the number 
for intensity Unfortunately, however, the determination of 
this product insoivcs more than simply the multiplication of 
two numbers, for the auditory area (cf Fig 19) is not square, 
nor is the density of DL’s the same m different regions There 
fore, m order to determine the total number of DL’s in the audi 
tory area, we find it most convenient to diude the area into 
small units and find the number for each unit separately 

This procedure is illustrated in Fig 63 The auditory ircn 
contained between the threshold of audibility jnd the threshold 
of feeling was divided into units, or cells, measuring half an 
octave in width by 10 db in height Then the height in DL’s of 
each cell was determined from the data oE Ricsz (1) and the 
width, also in DL’s, from the data of Shower and Biddulph 
These two values were multiplied to give the number of DL’s 
contained in each cell Then the total number in all the cells 
was found by addition These computations reveal that there 
are about 340,000 distinguishable tones in the entire audible 
range Curiously, when the total number of distinguishable 
colors is deduced from the known number of DL's for hue, 




Fig 63 The number of distinguishable tones in the auditory area The 
total area is divided into cells whose dimensions in DL’s are given by the 
numbers m the cell The first number gives the height of the cell in DL’s for 
intensity, the second gives the width in DL’s for frequency The product of 
these two numbers is written directly below 


154 


LOUDNESS 


brightness, and saturation, the result is of the same order of 
magnitude 

RELATION OF LOUDNESS TO DURATION 
When a tone is turned on, its loudness passes through a 
period of growth before reaching its final value It may, in 
some instances, reach a maximal loudness and then decline 
slightly to a steady state value (B&esy, 3) In general, tones 
lasting less than half a second appear less loud than tones of the 
same amplitude whose duration is greater (cf Bfkfsy, Id) 
Lifshitz has proposed an integral law relating loudness and 
duration For pure tones of short duration, the law reduces to 
It — K, where I is the loudness level of the sound (in decibels), 
t is time in seconds, and K is a constant This equation states 
that, in order to maintain constant loudness, the loudness level 
must be increased by the same proportion that the time is dc 
creased The author of the law obtained experimental data 
for times ranging from 0 012 to 0 69 see and for loudness levels 
from 34 to 84 db The data show that within these limits the 
hyperbolic relation holds Furthermore, the relation is cssen 
tially the same at all frequencies from 50 to 4000 cycles 

These results do not confirm those reported cadier by Bfkfsy 
(4) He found, for an 800-cycle tone at durations less than 0 1 
sec, that constant loudness was obtained when 

I = k log t + C 

Here \ and C are constants I is loudness-le\el (in decibels), and 
t is time The constant \ is negative, so that, as t decreases I 
must increase in order to keep the loudness unchanged 

Additional experimental evidence is probably needed to 
decide between these two functions relating loudness and dun 
non ft should 6c said, however, that the data of Ltlshttz arc 
rather more extensive than those of Bckesy 

THE LOUDNESS OF SHORT IMPULSES 

When the duration of a tone is made small enough, it 
reaches a point where we cease to call it a tone and place it m 



THE LOUDNESS OF SHORT IMPULSES 


155 


the class of impulses or short noises Short noises are a very 
common sort of sound and are made by an atmospheric dis 
turbance of very short duration In spite of their short duration 
they manifest great variety m their subjects e aspects Witness 
the qualitative difference between the sharp crack due to 
a spark and the dull boom of distant gunfire These differences 
are presumably due to differences in the forms of the sound 
waves The loudness of a sound consisting of a smgle wave is 
dependent not only upon the amplitude of the wave, as we 
should expect, but also upon the form of the wave The mvcs 
tigation of the response of the ear to short impulses should 
ultimately be of great importance for an understanding of audi 
tory function 

Short impulses of sound are most readily produced and 
studied by passing brief currents through a transducer The 
charge or discharge of a condenser through the transducer, with 
or without an inductance in the circuit, produces impulses whose 
form, duration, and amplitude can be controlled Thus, the 
discharge of a condenser through a resistance produces a current 
which rises almost instantaneously to a maximum and then 
declines according to an exponential curve (see Fig 64) The 
current declmes to 1/2 718 of its value in a time equal to the 
product CR, where C is the capacity and R the resistance in the 
circuit This product is known as the time constant, T, of the 
circuit 

A comparison of the loudness of condenser discharges (time 
constant = 1 msec) with the loudness of a 1000-cycle tone at 
various amplitudes shows that the loudness of impulses and of 
tones follows the same law of growth as a function of amplitude, 
except for small amplitudes (Steudel) Below a loudness-level 
of 40 db the loudness of the impulses decreased more rapidly 
than that of the tone, for a given reduction m amplitude It is 
especially necessary to employ experienced observers in work of 
this sort, because of the difficulty of comparing loudnesses m the 
face of large qualitative differences in the sounds 

Figure 64 shows the effect of duration, as measured by the 
time constant, T, on the loudness of impulses of fixed amplitude 



156 


LOUDVESS 


As T increases up to 1 msec the loudness grow s, for longer dura 
tions the loudness remains constant 3t a \alue dependent solely 
upon the amplitude 

When a condenser, in parallel with a transducer, is charged 
through a resistance, the current in the transducer rises accord 
ing to an exponential curve (concave downward) The loud 



T IN MILLISECONOS 

Fic Oi Showing how the loudness of an impulse varies w th its duration 
The impulse has a form as shown by the pressure-curve, and a time-constant 
T, as shown by the abscissa (After Steudel ) 

ness of a sound impulse produced by this current depends upon 
the time constant T in a manner like that shown in Fig 65 
The longer T, i e , the slower the rise in current, the weaker the 
sound If the impulse docs not reach two-thirds of its maxi 
mal value inside a tenth of a second, no sound is heard 

Another case deserves attention When the condenser is in 
a circuit containing resistance and inductance in the proportions 
proper for making the circuit critically damped, its discharge 
produces an impulse of the form shown m Fig 66 Steudel 
measured the loudness-level of this type of impulse as a func 
non of duration The duration was measured in terms of the 
time constant VLC, where L is inductance and C is capacitance 
The maximal amplitude of the impulse occurs when t = VLC 




THIS LOUDNESS OF SHORT IMPULSES 


157 


The results for a fixed amplitude are shown in Fig 66 With 
this type of impulse there is an optimal duration for its efTec- 



Fic 66 Showing how the loudness of an impulse depends upon its time 
constant ( V LC ) whose value is shown on the abscissa (After Steudel ) 

rapidly enough, the experienced loudness is augmented As 
the frequency of the impulses is increased from 1 to 50 per sec- 





158 


LOUDVESS 


ond, die loudness-level increases by about 10 db, but further 
increase has practically no effect Thu result checks with the 
finding that the loudness of a single impulse of given pressure 
amplitude is about 10 db less than the loudness of a pure tone 
of the same maximal amplitude 

From these experiments of Stcudel’s, vve may conclude 
that the ear reacts only to a change of atmospheric pressure 
Thu change must take place 
within a brief interval of 
time, but, if the change and 
its restoration take place too 
quickly, the ear detects noth 
mg (Thus a 30,000-cycle 
tone is inaudible) Steudel 
observed that it is the changes 
occurring within the interval 
of 03 msec which determine 
the loudness of the impulse 
In fact, the loudness is related 
to the area under the pressure 
curve contained within this 
short interval of time, pro- 
vided the interval contains 
the steepest part of the pressure curve Starting with these 
notions, Steudel was able to develop an empirical formula for 
the loudness level of impulses which agrees fairly well with 
observation 

In particular the formula shows that impulses of many dtf 
ferent forms may have equal loudnesses Figure 67 presents 
three of these forms whose loudnesses arc equal both by actual 
measurement and by calculation The shaded areas arc the 
parts of the impulses significant for loudness, according to 
Steudel’s conception 

Although Steudel’s empirical formula is adequate for cer 
tain situations, there is a more fundamental type of analysis 
which wc should consider This method of analysis is based 
on the fact that any wave, such as those in Figs (A to 67, can be 



scconos 

Fig 67 Vinous forms of impulses 
having the same loudness The shaded 
areas represent the part of the impulse 
which is effective m producing loud 
ness (These impulses would not sound 
the same in pitch ) (After Steudel ) 




THE LOUDNESS OF SHORT IMPULSES 


159 


analyzed into its Fourier components, just as a complex tone 
may be resolved into a group of pure tones The difference is 
that, whereas only certain harmonically related tones will be 
found present in a complex tone, an impulse of sound can be 
analyzed mto a continuous spectrum of acoustic energy Thus 
a sound made by the wave pictured in Fig 65 contains all fre 
quencies within the audible range, but the relative amplitude 
of these components is smaller at the higher frequencies (cf 
Fig 39, p 104) 

Now, by determining first the spectrum of the sound lm 
pulse, and second the sensitivity of the ear to the various 
frequencies of the spectrum, we might be able to compute the 
loudness of the impulse Burck, Kotowski, and Lichte (1) 
did so, and obtained theoretical curves which agree remarkably 
well with their own (Fig 65) and with Steudel s measurements 
(Figs 64 and 66) 

The significance of this type of analysis, provided it can be 
shown to apply to the phenomena we have just considered, is 
evident indeed It indicates that, even in short aperiodic stimuli 
—clicks, pops, cracks, etc — the ear behaves essentially as an 
analyzer and evaluates the components of the sound according 
to the frequency and intensity of the components (cf Fig 117, 
P 283) 



CHAPTER 5 


THE OTHER ATTRIBUTES OF TONES 

It has been the traditional view o£ psychology that the attributes 
of sensation show a one to-one correspondence to the dimensions 
of the stimulus Some such view is also implicit in the naive 
epistemology of the physicist He tends to think of pitch as if 
it were the perception of the frequency of a tone, but we hate 
seen that holders of that view run into difficulties The pitch 
of a pure tone can be altered without changing its frcqucnc) , 
likewise, the loudness of a tone may be varied without changing 
its intensity Pitch is a funetton of the two physical variables, 
frequency and intensity— loudness is a different function of the 
same two variables Both pitch and loudness are fundamcn 
tally to be conceived as reactions on the part of organisms to 
sound waves These are systematic reactions, to be sure, and 
can be ordered on scales and evaluated, but they are, neverthe 
less, products of the interaction of an atmospheric disturbance 
with a living system 

The problem, then, suggests itself as to whether other sys- 
tematic reactions to pure tones can be obtained from normal 
observers Are there additional tonal attributes ? Certainly 
there can be no theoretical, or a prion presumption against them, 
for the number of different functions of the two variables, fre 
qucncy and intensity, that can be conceived is unlimited (cf 
discussion by Boring, 4) The only limitation on the number 
of possible attributes of sensation is a practical one Differ 
entiation of the sort that can lead to differential neural reaction 
is not unlimited in the finite organism We may properly 
expect to find as separate attributes of tones only those functions 
of frequency and intensity whose differentiation falls within 
the resolving power of the organism 
160 



VOLUME 


161 


VOLUME 

Several writers, during the last century, have disclosed the 
fact that tones are characterized by an apparent ‘ largeness” or 
“extensiveness ’ (see Rich, 1) The low tones of an organ 
appear to be “bigger” than the high chirp of a cricket, even 
when the loudnesses of the two are equal This subjective as 
pect of a tone is known as volume (The radio-engineer speaks 
of the ‘volume of a sound, or of the 'volume control’ of a radio 
set, but he means by volume what we should properly call 
intensity ) 

Rich (1) made the first attempt to bring the problem of 
volume into the experimental laboratory, by measuring just 
noticeable differences in volume as a function of frequency 
Halverson (3) later measured these differences as a function of 
intensity The argument was that, if the DL s for volume are 
different from those for pitch and loudness, it follows that 
volume is a separate attribute of tones, since it obeys different 
laws Later experiments (Zoll) failed, however, to confirm the 
independent status of volume when the same technique was 
used The results of different experimenters simply did not 
agree Nevertheless, throughout all the experiments evidence 
accumulated from the reports of the observers that, phenome 
nally, volume is a unique and distinct attribute of tones 

The validity of volume as an attribute was established ex 
perimentally (Stevens, 2) by the same method used to determine 
the isophomc contours of other attributes — the equal pitch 
contours (Fig 23, p 71), and the equal loudness contours 
(Fig 45, p 124) The observer was given alternately tones of 
different frequency, and he varied the intensity of one until it 
equaled the other with respect to volume In other words, it is 
possible to make two tones appear equal in volume when they 
are obviously different in both pitch and loudness This result 
is achieved by making the higher tone more intense than the 
lower tone Therefore, we must conclude that the volume 
of a tone increases with intensity and decreases with frequency 

Equal volume contours, covering a limited range of fre 
quency and intensity, are shown in Fig 68 The slope of these 



162 


THE OTHER ATTRIBUTES OF TONES 


contours changes with sensation-level in such a way as to indi- 
cate that, at high intensities, the relative e fleet h eness of intensity 
is greater than that of frequency as a determiner of volume. 
The reverse is true at low intensities. 



-25 O 25 50 

FREQUENCY 


Fic, G8 Equal volume contours, showing how a difference in frequency 
can be offset by a difference in intensity in order to keep volume constant. 
The change in intensity is different at different sensation levels (parameter) 
These curves were drawn from values obtained by subtracting the intensity 
at which a tone appeared equal in volume to the standard tone of 900 cycles 
(zero on the abscissa) from the value at which it appeared equal to the stand 
ard in loudness (After Stevens, 2 ) 

The contours of Fig. 68 do little more than establish the 
possibility of making volumic equations. To this extent they 
justify the concept of tonal volume, but they leave much to be 
desired. We should like to know the form of such contours 
ercr wider ranges frequency sin. i intense: ss we do Sor 
pitch and loudness. We should like to establish the form of a 
subjective scale for volume analogous to those for pitch (Fig. 26, 
p 81) and for loudness (Fig. 43, p. 118). There is also the dif- 
ficulty with the present volume-contours that they are slightly 
sigmoid in form, a fact which means that, when two tones arc 




DENSITY 


163 


each equated to a third tone in volume, they may be found to 
be not quite equal to each other Systematic errors, arising 
from faulty experimental conditions, must be blamed for this 
discrepancy 

A different sort of experiment was performed by Bekesy (9), 
who estimated directly the apparent diameter of a sound 
generator placed some distance away The apparent diameter 
increased from about 0.9 to 15 meters as the mtensity of an 
800-cycle tone was raised from 10 to 60 db above threshold At 
a given mtensity a 100-cycle tone appeared larger than the 800- 
cycle tone These results show that volume can be readily 
discriminated under other conditions than those used to obtain 
the contours of Fig 68 


DENSITY 

The tendency of observers to characterize some tones as bemg 
more ‘tight,’ ‘hard,’ ‘compact,’ or ‘dense’ than others led to an 
experimental investigation of the possible existence of a fourth 
attribute of tones (Stevens, 3) Agam, observers were pre 
sented alternately with two tones of different frequency and 
allowed to change the mtensity of one of them until the two 
tones appeared equal m density The meaning of the concept 
(cf Stevens, 6) was illustrated to the uninitiated observers by 
presenting them with a high tone (4000 cycles), followed by a 
low tone (200 cycles) The observers quickly recognized the 
dense compactness of the high tone as contrasted with the 
diffuseness of the low tone When the intensity of a tone was 
increased, they noticed that the density tended also to increase 

An equal-density contour is shown in Fig 69 Density is 
established as a discrimmable aspect of tones, distinct from 
pitch, loudness, and volume, by the nature of this isophonic 
contour Here, the curve is plotted from the differences be- 
tween the values at which two tones are matched in density 
and the values at which they are matched m loudness, so that 
the observers could not have confused density with loudness 
The fact that they did not confuse density and volume is ap 
parent from the negative slope of this contour as contrasted 



164 


THE OTHER ATTRIBUTES OF TONES 


with the positive slope of Fig 68 The equal pitch contours 
are likewise obviously different The equal-density contour 
means that for two tones to be equal in density the lower tone 
must be more intense 

Here, as in volume, we have done little mote than, demon 
strate the existence of a tonal attribute An unpublished expen 
ment in the Harvard Laboratory has confirmed the form of the 
density contour for the particular frequencies and intensities 



Fic 69 Equal-dens ty contour showing how a difference in frequency can 
be offset by a difference m intensity in order to keep dens ty constant The 
standard tone was 500 cycles (zero on the abscissa) at 60 db above threshold 
(zero on tl e ordinate) (After Stevens 3 ) 

employed, but we should like to know more, m a quantitative 
way, than the general fact that density increases with frequency 
and also with intensity 

The physiological basis of density and volume is not obvious 
One would like to think of density as related to the density of 
nervous excitation at the cortex Volume, then, might depend 
upon the spread of this excitation, as Boring (1) suggested, and 
loudness might depend upon the total amount Pitch appears 
to be related to its location Such a situation is conceivable, but, 
in view of our present state of knowledge, jt is perhaps wiser 
to limit ourselves to less speculative assertions Besides, such 




THE QUESTION OF BRIGHTNESS 


165 


charming simplicity in psychophysiological relations is scarcely 
to be hoped for We can say, however, that the interaction of 
a sound wave with the auditory mechanism of a subject who 
has been instructed to observe tonal density sets up a pattern of 
neural excitation which is differentiated so as to issue in a dis- 
criminatory response Since the response when the instruction 
is to judge density is different from the response which occurs 
when the subject is set to observe pitch, loudness, or volume, it 
must follow that the total neural event is patterned differently 
m each case, although the precise nature of the neural pattern 
and the manner of its arousal by the cochlear mechanism remain 
obscure 


THE QUESTION OF BRIGHTNESS 
The previous discussion has considered only the problem of 
the attributes of pure tones The candidacy of any other attn 
bute for recognition as an attribute of pure tones can be success- 
ful only when it can be shown that the new attribute is a 
different function of frequency and intensity from those of 
pitch, loudness, volume, and density 

Brightness has probably been the most persistent claimant for 
recognition It is well known that tones can be characterized 
as bright or dull, but this aspect of tones has been said to be so 
closely correlated with pitch that the two form a single dimen 
sion (Troland, 1) Nevertheless, Abraham thought he had 
demonstrated the independence of pitch and brightness by pro- 
ducing, on a Seebeck siren, tones having the same fundamental 
frequency, but differing in brightness He concluded that 
brightness is a function of the ratio of the size of the hole m 
the siren dish to the size of the closed mtervalhetween the holes 
An analysis, by means of an electrical wave analyzer, shows 
that the difference between Abraham’s tones is a matter of the 
proportion of higher partials present (Boring and Stevens) 
Furthermore, the brighter tone is the one with the larger 
proportion of higher partials present Further investigation 
showed that observers agreed in calling the louder of two com 
plex tones the brighter In other words, brightness was found 



166 


THE OTHER ATTRIBUTES OF TONES 


to be a joint function of the frequency of the dominant compo- 
nents in a tone and the intensity of the tone. Density, as we 
have already seen, depends upon frequency and intensity in the 
same way. This fact suggests the conclusion that density and 
brightness may be two words for the same attribute. 

This conclusion receives further support from the fact that 
an effort to get observers to equate two pure tones in brightness 
is unsuccessful when the observers are told that brightness is 
something different from density. On the other hand, ob- 
servers who are unfamiliar with density are able to make con- 
sistent judgments of brightness, and for them brightness turns 
out to increase with both intensity and frequency, according to 
the relation which has, with other observers, been established 
for density. It seems, therefore, that brightness and density 
vary together to such an extent that the two attributes ought, 
at least for the present, to be considered identical. 



CHAPTER 6 

AUDITORY LOCALIZATION 

People can normally locate with surprising accuracy the source 
of the sounds they hear This phenomenon, coupled with the 
fact that sound does not travel in a straight line nor produce 
sharp shadows when it passes objects, has led to much expert 
mental research designed to explain our ability to localize The 
analogous problem m vision is less puzzling, because one does 
not see a source of light unless the eyes are turned toward it 
Sounds reach the ears, regardless of the direction from which 
they come, and the cues by which the observer can detect the 
location of the source arc subtle indeed 

Practically all theories of sound localization start from the 
assumption that the listener observes certain characteristics of 
the sound which, as he perceives them, vary with the position 
of the source Then, by comparing these cues with information 
derived from past experience with sound sources, he makes a 
judgment about the location of the source In order com 
pletely to fix the position of the source, he must be able to assign 
to it three coordinates — its distance and some two angles defin 
mg its direction He can fix the sound only when he can 
observe at least three of its independent properties which are 
functions of its position If fewer than three properties are 
available, some indeterminateness in localization is certain to 
arise If more than three are available, the listener has the pos 
snhiliy ui £ niLTttftiiig _ nhr centnmyinfihy juu^mern‘ ilhmigil' cihr 
use of the additional cues 

Now, these theoretical considerations apply when the ob 
server attempts to locate the source, as regards both its direction 
and its distance However, such complete localization of tones, 
especially of pure tones, is very difficult The placement of the 
ears, on opposite sides of the head, often provides an adequate 
cue for right left localization, although only meager evidence 
167 



166 


AUDITORY LOCALIZATION 


for the other coordinates Consequently, most research has 
been centered upon the problem of directional localization in 
the horizontal plane, where the cues consist of differences in 
intensity and phase (or time of arrival) of the sound at the two 
ears Especially has much experimentation sought to evaluate 
the roles of phase, intensity, and time in sound localization, by 
means of Hickottc stimulation, ie, by leading to the two cars 
separately sounds differing in intensity, phase, or time of arrival 
Many experimenters have sought to determine the relative 
merits of the ‘intensity theory,’ the ‘phase theory,’ and the 'time 
theory * Each of these factors — difference of intensity, of phase, 
or of time — influences localization, each has been nominated 
by one or more experimenters as the most important factor in 
localization, and each has been reduced, m theory, to one of 
the others 


THE ROLE OF INTENSITY 
When two tones, differing only in intensity, are led sepa 
rately to each C3r, the listener tends to image the source as 



Fie. 70 The difference iri loudness-level produced m the right ear when a 
source of pure (one is moved from the nght to the left of an observer The 
curve shows how masked is the sound-shadow produced at d ffercevt fre 
quenc es by the head (Steinberg and Snow ) 


located toward the side of the greater intensity It should be 
emphasized, however, that this form of stimulation corresponds, 



THE ROLE OF INTENSITY 


169 


in general, to no actual source of sound An actual source 
would produce differences of phase as well as of intensity 
Nevertheless, the tendency for the imagined source to shift to 
the side of the greater mtensity is often very compelling 

When an actual source of sound is situated at the side of an 
observer, a difference of mtensity occurs at the two ears, for 
one ear finds itself located in the shadow of the head In a 
tone, the sharpness of this shadow is a function of frequency 
Very low tones produce practically no shadows, but, when the 
frequency is greater than 5000 cycles, the difference in loudness- 
level at the two ears may be as great as 30 db Figure 70 
shows the difference in loudness-level at the two ears when 



Fig. 71 The variation in loudness-level as a speech-source is totaled in a 
horizontal plane around the head The dotted curve shows the difference 
in loudness-level between the two ears for various azimuths (Steinberg and 
Snow) 

tones of various frequencies come directly from the side (azi 
muth of 90°) The difference may be even greater for other 
azimuths (Sivian and White) It is clear from Fig 70 that 
when the sound is complex, such as speech or music, not only 
is there a difference of intensity at the two ears, but a difference 





170 


AUDITORY LOCALIZATION 


in composition as well The high frequency components ire 
lost to the ear on the far side of the head 

Calculation of the average loudness-level of speech at the 
two ears, for various positions of the source, gave the results 
shown in Fig 71 (Steinberg and Snow) At 42® and 137® 
the difference in total loudness is the same, but the quality of 
the speech is different at the two ears, because the function 
relating loudness to azimuth is not the same at all frequencies 
In the first place, the cars are not diametrically opposite, but 
are about 165° apart, and, m the second place, the external 



Fsc. 72 Showing the amount (At) by which a tone (600 cycles) * 
car must be made more intense than the same tone in the oil er ear in 
to produce a just noticeable shift of the apparent source from the med al 
(After Upton ) 


one 

order 

plane 


ear flap casts a noticeable shadow at high frequencies, with the 
result that sounds originating behind the head suffer a different 
distortion from those originating m front 

In the laboratory situation, where tones differing only in 
intensity fall on the two cars separately, the apparent source is 
displaced laterally, whenever the difference exceeds a certain 
threshold value Fig 72 shows how large this difference must 
he (Upton) The value M was found by measuring the 




THE ROLE OF PHASE 


171 


amount by which the intensity of an 800-cycle tone in one ear 
had to be increased to cause a shift of the apparent source of 
sound from the median position These results represent a 
certain type of differential sensitivity, they should be compared 
with the curves in Fig 54 (p 138) The smallest ratio of A 7 
to 7 obtained by Upton s dichotic method is about twice as large 
as that for the monaural method used by Riesz, when the in- 
tensity, in both cases, is stated m terms of the energy of the 
stimulus (Care must be taken, in comparing the ratio A 7/7 
in different experiments, to be sure that the units in which 7 
is measured are the same Thus the values reported by Upton 
need to be transformed from units of pressure to units of energy, 
before they can be compared with those of Riesz, because energy 
is proportional to the square of the pressure ) 

As the difference m intensity at the two ears is increased 
beyond the minimal value necessary to displace the sound from 
the median plane, the apparent source moves farther to the 
side Stewart (3) reported that the angular displacement of 
the apparent source is proportional to the difference, in decibels, 
between the intensities in the two ears However, it is im 
probable that such a simple relation obtains for all frequencies 
and intensities Some of Stewart’s observers showed great 
variability in their results, and with some of them the intensity- 
effect was completely absent 

THE ROLE OF PHASE 

When two tones, differing only in phase, are led one to each 
ear, the listener tends to image the source as located toward the 
side of the leading phase A phase-difference means that the 
crest of one sound u ave arrives at its receptor before or after 
the crest of the other wave 

Clearly, if a sound wave comes from the side, the crest of a 
particular wave reaches the nearer ear before it reaches the ear 
on the far side of the head But, when the sound is a continuous 
tone consisting of successive waves, the situation may become 
ambiguous, for when the phase at one ear leads the phase at 
the other ear by more than 180° it can be said no longer to lead. 



172 


AUDITORY LOCALIZATION 


but to lag This situation may arise when a er the difference in 
the length of the path to the two ears is greater than half of the 
wave length of the sound, for then there is a position of the 
source on both sides of the head which will give die same phase 
difference at the two ears Thereupon, phase becomes an am 
biguous cue for localization 

These considerations indicate that, for high frequencies, 
where the wave length is short, localization based on phase 
differences should break down The critical frequency for 
this break-down can be readily calculated It is the frequency 
whose wave length is just twice the distance between the ears, 
which arc about 21 cm apart This frequency turns out to be 
about 800 cycles With any higher frequency there are posi 
tions both on the right and the left sides of the head at which 
the source could be placed to yield identical phase-differences 

Much effort has been expended in trying to evaluate the 
effects of phase b) leading tones of different phase to the two 
ears Although it is possible to obtain lateral displacement of 
the apparent source when the phase at one ear is advanced or 
retarded, the results of these experiments show great mcon 
sistencies Typical is the finding that some observers show a 
‘phase-effect’ whereas others show no effect at all, even under 
identical conditions (Wightman and Firestone) When listen 
ing to binaural beats where the phase relations at the two cars 
undergo continuous change, most observers notice no apparent 
shift in localization unless the change is suggested to them 
Thus it appears that the listener s attitude is of great importance 
(Valentine) 

The upper limit in frequency at which the ‘phase-effect’ is 
detectable has been placed at values ranging from 512 cjclcs to 
17,000 cycles (Trimble, 1) The fact that the refractory period 
of tb.e fibers. ux the. auditory nerve limits the frequency at which 
each fiber can respond to each sound wave suggests that the 
‘phase-effect* should cease at frequencies greater than about 800 
or 1000 cycles (see p 398) At any rate, a similar prediction 
appears to hold true regarding binaural beats, which are not 
detectable above that range of frequencies (Stevens and Sohcl) 



THE CUES TO DISTANCE 


173 


THE ROLE OF TIME 

As already pointed out, phase difference means that the 
crest o£ a sound wave arrives at one ear before it arrives at the 
other When the sound consists of a single sharp sound wave, 
such as a click, we customarily speak of its time of arrival at 
the two ears rather than its phase When two sound impulses 
differ in time at the two ears by the proper amount, the apparent 
source tends to shift to the side of the first arrival There is, 
of course, a minimal value of the time difference below which 
no displacement occurs And there is another value above 
which the sound breaks up and appears double — one sound 
at each ear The lower value is of the order of 0 1 msec and 
the upper value is of the order of 2 msec, although some authors 
have reported values quite different from these (cf Trimble, 
2) Between these values, the apparent displacement of the 
sound source is roughly proportional to the time difference 
(Trimble, 3, Behesy, 5) 

Under the proper conditions, when a difference of intensity 
is apposed with a temporal difference in the two ears, the two 
tendencies to lateral localization may cancel one another and 
leave the apparent source of sound in the median plane 
(Trimble, 4 , see also p 427) 

THE CUES TO DISTANCE 

We have seen that, for the lateral localization of tones, differ 
ences in intensity and phase at the two ears provide serviceable 
cues, for clicks the cues are differences m intensity and time 
What are the cues by which we judge the distance of a source of 
sound? 

The total intensity of a sound is a cue to its distance, provided 
the sound is a familiar one However, more interestmg cues, 
theoretically, are those based on the combined intensity and 
phase relations at the two ears These relations change with 
the distance of the source from the observer, even when the 
direction of the source remains the same The effect of distance 
can be conveniently measured with a man shaped dummy 
whose ears have been replaced by microphones Figure 73 



174 


AUDITORY LOCALIZATION 


shows the amplitude ratio (measured in terms of sound pres- 
sure) and phase difference at the two ears of such a dummy. 
The stimulus is a 256-cyclc tone, placed at the direction or 
azimuth indicated by the abscissae, and at distances from the 
center of the head as marked on the curves (Wightman and 



AZIMUTH 


Fic 73 Showing the deferences in intensity and in phase produced at the 
two ears when a 25&<ycle tone is »ounded at various azimuths and distances 
(After Wightman and Firestone ) 


Firestone) These curves agree well with the curves that 
Hartley and Fry computed on the assumption that the head 
js a rigid sphere 

The phase-difference depends upon the azimuth, but its 
dependence on the distance of the source is slight, and, owing 
to the random reflection of sounds from the shoulders, it is 
erratic. There ts, however, a less ambiguous relation between 
the amplitude ratio and the distance of the source This ratio 
depends on both the distance and the azimuth Therefore, we 
might expect theoretically that, with the knowledge of phase- 
difference to determine the direction of the source, the additional 


THE LOCALIZATION OF ACTUAL SOURCES 


175 


knowledge of the amplitude ratio would enable one to deter 
mine the distance of the source Even without a knowledge of 
the direction, a knowledge of the amplitude ratio should enable 
a certain limiting distance to be fixed At this frequency of 
256 cycles, for instance, an amplitude ratio of 0 40 would indi 
cate that the source is certainly within 50 cm of the head The 
curves show that this method of determining the distance of a 
source can be accurate only when the source is within about 
100 cm of the head and at the side, for it is only then that the 
amplitude ratio changes appreciably with distance 

Wightman and Firestone presented 256-cycle tones, differing 
m amplitude and phase at the two ears, and investigated the 
accuracy with which naive observers could localize them as to 
direction and distance Some of the observers were consistent 
in judging azimuth, but none of them was able consistently to 
estimate distance Hence, it appears that observers are not, m 
general, able to utilize the slight cues with which they might 
judge distance from differences in phase and mtensity at the 
two ears 

THE LOCALIZATION OF ACTUAL SOURCES 
The previous discussion has dealt with the problem of 
auditory localization by analyzing the factors which provide 
cues as to the position of the source We have considered the 
ability of a listener to assign an apparent direction to a sound 
when it is led to his ears dichotically, but there remains the 
problem of the accuracy with which actual sounds in free space 
can be localized It is true that this latter problem was his 
toncally, the first to be investigated, but interest in it was largely 
xtbpstd by uiL ttCWitty tWit siudy aitoftcnA. 

dichotic stimulation under laboratory conditions 

The earliest systematic investigations of localization were 
earned out by means of ‘sound cages ’ These were convenient 
devices for holdmg a source of sound at any desired position 
about the head of a subject, whose task it was to indicate the 
direction from which the sound came The results of these 
investigations were limited by the fact that, owing to the lack 



176 


AUDITORY LOCAUZATION 


of electrical generating apparatus, it was necessary for the most 
part, to use clicks and noises as stimuli Another limiting 
factor was the custom of experimenting in closed rooms whose 
walls were not sound absorbent, and which reflected sound 
from many directions at once In spite of these drawbacks, cer- 
tain facts were early established (Pierce) Observers arc able 
(1) to locate noises better than tones, and (2) to distinguish 
right from left with great accuracy However, they tend (3) 
to confuse the location of sounds lying in the median plane, and 
(4) to distinguish with the least accuracy small changes in the 
azimuth of sounds coming directly from the sides This last 
finding is predictable from the curves of Fig 73 Near the 
azimuth of 90° there is a range of about 30° throughout which 
the phase and intensity relations at the two ears change prac- 
tically not at all Withm this range all cues for localization 
must appear alike to the listener. 

One condition necessary to a successful study of the ability 
of a listener to locate an actual source of pure tone is that all 
the tone should reach the cars directly from the source and none 
of it from reflecting surfaces An attempt to satisfy this con 
dition was made by seating the observer m a tall swivel-chair 
on top of a high ventilator rising above the roof of a building 
(Stevens and Newman, 1, 2) The tones were generated m a 
loud speaker mounted on the end of a 12 ft arm attached to 
the pedestal of the chair In this way, the source could he 
moved noiselessly in a complete circle at the level of the ob 
server s cars Since confusion between right and left seldom 
occurs, the subject of the experiment was required to name from 
which of thirteen positions, spaced 15° apart on his right side, 
the tone appeared to emanate The average of the errors made 
by two observers is plotted in Fig 74, plot A (It was not 
counted as an error if the listener confused positions in the front 
quadrant with those in the rear quadrant ) The errors arc 
relatively constant at low frequencies, but become definitely 
larger as the frequency of the tone approaches 3000 cycles 
Abo\e 4000 cycles, however, localization improves again and is 
quite as accurate at 10,000 as at 1000 cycles 




Fig 74 Plot A shows the average of the errors, m degrees, made by two 
observers in localizing a source of tone at various frequencies Circles and 
crosses are for two different series of observations Triangles are for impure 
tones 


Plot B shows the absence of phase-effect at high frequencies and of intensity 
effect at low frequencies The solid curve represents theoretically the maximal 
angle by which a tone can be displaced from the median plane by 180° change 
in phase. The circles on the dotted curse represent svhat Halverson (4) 
reported as the observed Limit of displacement The dot-dash curve represents 
the observed difference m intensity at the two cars of tones originating at the 
side of the head (Sivian and White). 

Plot C shows the percentage of confusions betw ecn front and rear quadrants 
(Stevens and Newman, 2 ) 




178 


auditory localization 


The explanation of the shape of the curve in plot A must 
concern us The inexact localization of tones between 200G 
and 4000 cycles is precisely what we should expect from a con 
sideration of the effects of the two localizing factors, difference 
in phase and in intensity Owing to the size and shape of the 
head, there are, as already pointed out, theoretical limits to the 
possible effectiveness of each of these factors These limits are 
shown graphically in plot B of Fig 74 It is well established 
that phase difference is most effective in determining the ap- 
parent location of low tones, and that, above some frequency 
m the neighborhood of 800 cycles, its effectiveness decreases 
with increasing frequency The solid curve in plot B represents, 
theoretically, a first approximation to what is the maximal 
lateral shift in localization obtainable with a phase difference 
of 180° In other words, it shows how far from the median 
plane a tone of a given frequency would have to be to produce 
a phase-difference of 180° The dotted line shows the results 
reported by Halverson (4) when he measured the apparent 
displacement of the source by leading tones to the two ears 180° 
out of phase Clearly, then, phase is an effective cue for local 
izmg low tones, but is ineffective for high tones 

It should be noted m passing that Halverson’s results are 
in conflict with the results of Stevens and Sobel, who failed to 
detect binaural beats at frequencies above 800 ejeles Indeed, 
it is difficult to conceive how phase differences could possibly 
produce displacements of the apparent source at very high fre 
quencies, m view of the fact that the ability of the impulses in the 
auditory nerve to synchronize with the frequency of the stimulus 
breaks down at high frequencies (see p 394 and p 421) It 
may be that Halverson’s results wrcrc due to changes of intensity 
rather than of phase 

The dof-dash curve of p/of B represents die observed differ 
ences in intensity at the two can of tones originating at the 
side of the observer (see Fig 70) These differences are small 
at low frequencies, but above 4000 ejeles they increase rapidly 
In other words, relative intensity provides a good cue for local 
izing high tones, but not for low tones In the region near 



THE LOCALIZATION OF ACTUAL SOURCES 


179 


3000 cycles, neither relative intensity nor phase offers very 
adequate cues, and it is precisely in this region that the errors 
of localization are greatest, as shown in plot A 

Another interesting difference in the localizability of high 
and low tones occurs in the matter of front back discrimination 
Plot C of Fig 74 shows the percentage of confusions of the 
front back quadrants which occurred in the experiment we are 
considering It is apparent at once that the total range of fre 
quencies is divided into two distinct regions separated by a 
critical region near 3000 cycles For low tones, where locahza 
tion is based on phase differences, discrimination between the 
front and back quadrants is only a little better than chance 
Above 4000 cycles the number is but one third of those expected 
by chance Apparently the ability to distinguish front from 
back in high tones is due to a difference in intensity between 
sounds in front and behind 

A number of checks were made to validate this notion A 
continuous tone of 10,000 cycles, when swung m a circle com 
pletely around the listener, appeared much weaker behind the 
listener than in front of him Then, a number of tests were 
made m which the actual mtensity of the tones was varied from 
trial to trial, with the result that the number of confusions in 
creased over that usually found for low tones It appears that 
the observer is able to form a subjective standard of mtensity in 
a very few trials, and afterwards the tones heard behind are 
the weak ones and those heard in front are the strong ones 
Sound shadows from the external ear must account for this 
effect 

In view of these findings, it is no longer surprising that 
complex tones and noises can be localized with relative ease 
When both low and high frequencies are present as components 
in a sound, the low frequencies provide cues in the form of 
phase-differences and the high frequencies provide cues in the 
form of mtensitive differences, and the two types of cues render 
each other mutual support In addition, the attenuation of the 
high frequencies, in sounds coming from behind the listener, 
changes their quality as well as their loudness The result is 



180 


AUDITORY LOCALIZATION 


an accuracy of localization greater than that obtainable with 
pure tones 

The remarkable effectiveness of changes in the quality of 
complex sounds for localization when the direction of the source 
changes is shown by the ability of a person, deaf in one ear, to 
localize familiar complex sounds That these changes actually 
contribute to sound localization is supported by experimental 
evidence In fact, for complex sounds, the accuracy does not 
differ greatly when the localization is made monaurally instead 
of bmaurally ( Angell and Fite) 

THE FACTOR OF MOVEMENT 

Our concern thus far has been with static localization, ie, 
the localizabihty of tones coming from a fixed direction When 
no relative movement occurs between the observer and the 
source, the ability of the observer is, for the most part, limited 
to the designation of how many degrees a sound is from the 
median plane, and localization relative to other planes is ex 
tremely difficult, especially for low tones However, most 
actual cases of localization involve an additional dynamic factor 
of movement When we go hunting for a songbird whose 
music attracts us, we are free to move our heads, and thereby 
add materially to our cues for localization 

The effectiveness of movement can be illustrated by consul 
ering the simple case in which the listener is allowed to move 
his head from right to left m the horizontal plane He should 
then be able to tell front from back in the median plane, for, if 
the tone is in front and he turns to the left, the tone will appear 
to be on the right side of his head, whereas, if he turns to the 
right, the tone will appear on the left side The opposite w ould 
be true if the tone were behind him If the tone were directly 
overhead, moving the head would not alter the relative phases 
or intensities at the ears, and this fact would be the cue to the 
location of the source Similarly, other positions of the source 
would produce binaural differences which movement of the 
head would alter in some characteristic fashion 



THE STEREOPHONIC EFFECT 


181 


THE STEREOPHONIC EFFECT 
A novel attack on the problem of localization, one that has 
only recently been initiated, is the investigation of auditory 
perspective the stereophonic effect of multiple sources of sound 
When listening directly to an orchestral production, the 
audience senses the spatial relation of the various mstruments of 
the orchestra This spatial character of the sounds gives to the 
music a characteristic of depth and extensiveness Ideally this 
auditory perspective should be preserved when the music is 
reproduced — by radio broadcast, for example — but when a 
single microphone is used to pick up the music, the possibility 
of re-establishing the binaural differences which would make 
for perspective in the reproduction is lost 

There are two ways of reproducing sounds in true auditory 
perspective One is binaural reproduction m which there is 
led to the observer s ears by means of earphones an exact copy 
of the sound waves which would stimulate his two cars if he 
were listening directly We can do this conveniently by pick 
ing up the sound with two microphones, placed in the position 
of the ears on a man shaped dummy and connecting one ear 
phone to the amplified output of each microphone Then, if 
someone walks around the dummy, talking as he goes a person 
wearing the earphones has a compelling illusion of someone 
walking around him The other method uses two or more 
microphones and a corresponding number of loud speakers, 
and aims to reproduce m a second room an exact copy of the 
pattern of sound vibration that exists in the original room 
Ideally, an infinite number of microphones and loud speak 
ers of infinitesimal dimensions would be needed to make the 
reproduction perfect, but, m practice as few as two microphone 
loud speaker combinations (channels) have been found to give 
fair auditory perspective Extensive tests were earned out with 
two and three channels in vanous combinations m order to 
determine the adequacy of such methods (Steinberg and Snow) 
Figure 75 shows a diagram of the experimental set up and the 
results obtained The microphones were set on a ‘pickup* 



182 


AUDITORY LOCALIZATION 


stage and the loud speakers were placed at the front end of an 
auditorium, behind a curtain of theatrical gauze The average 
position of a group of twelve observers is indicated by the cross 
m the rear part of the auditorium These observers vv ere asked 



Fic 75 Diagram of arrangement (left) for tesu of the stereophonic effect, 
and (right) the results obtained. (Steinberg and Snow ) 


to indicate from what point behind the curtain the sound 
(speech) appeared to come, and their judgments were com- 
pared with the actual positions on the 'pick up’ stage 

With three-channel reproduction, there is reasonably good 
correspondence betw een the caller’s actual position on the ‘pick- 
up’ stage and his apparent position on the virtual stage, both 
as regards right and left, and front and back Thus the system 
affords depth as w ell as angular lofcaIiz3tion For comparison, 
there is shown in the last diagram the localization obtained by 
direct listening The crosses indicate the caller’s position be- 
hind the curtain and the circles indicate his apparent position, 
as judged by the observers listening to his speech directly 




THE STEREOPHONIC EFFECT 


183 


With two-channel reproduction, the virtual stage tended to 
appear wider and less deep than with three-channel reproduc- 
tion 

Steinberg and Snow showed that the accuracy of the angular 
localization under these conditions can be accounted for by a 
consideration solely of the loudness-differences at the two ears 
of the observers Indeed, it is difficult to see how phase differ 
ences could, in multi-channel reproduction, assist the localiza 
tion in any way The angular location of each position on the 
virtual stage results from a particular mtensitive difference at 
the two cars produced by the speech coming from the loud 
speakers The factors influencing depth localization are not, 
however, so simply apparent It has been shown that, even 
with single-channel reproduction, an increase m the ratio of the 
sound reaching the microphone directly to that reflected to the 
microphone from the walls causes the sound to appear closer 
to the listener In other words, more reverberant sound is 
heard when the source is farther away in the room (Max field) 
This point is of practical importance to motion picture en 
gineers 

If the quality of the sound from the various loud speakers 
in the multi-channel arrangement differs noticeably, it has rm 
portant effects on localization When the two-channel micro- 
phones were so arranged that one picked up mostly direct and 
the other mostly reverberant sound, the virtual source was 
localized exactly at the ‘direct’ loud speaker, until the power 
from the ‘reverberant’ loud speaker was from 8 to 10 db greater 
In general, localization tends toward the channel giving the 
most natural reproduction, and this effect can be used to aid 
the loudness-differences in producing angular localization 



CHAPTER 7 


AURAL HARMONICS AND 
COMBINATION. TONES 

When the ear is stimulated by a pure tone, we hear, not only 
that tone, but also a series of harmonics, or overtones, whose 
frequencies are multiples of the frequency of the original tone 
Although traditionally these overtones have been called ‘sub- 
jective harmonics/ the fact that they are generated by a physical 
process m the ear itself makes it proper to refer to them as aural 
harmonics When, for example, a pure tone of 500 cycles is 
sufficiently intense, a well trained ear has no difficulty in detect 
mg a pitch corresponding to 1000 and to 1500 cycles Likewise, 
when two loud tones arc sounded together, w e hear, m addition 
to these primaries, a group of combination tones made up of 
frequencies which are the sums and differences of the frequen 
cies of die two primaries and of their several harmonics We 
shall see, in this chapter, to what extent it is possible to measure 
these various tones, and to account for their presence in terms 
of the nonlinearity and asymmetry of the auditory mechanism 

INDIRECT MEASUREMENT OF AURAL 
HARMONICS 

It is obviously impracticable to measure the intensity of an 
aural harmonic simply by listening to it In fact, many times 
listening does not even reveal the presence of the harmonics 
They can, however, be discovered and measured when an 
auxiliary tone is introduced at a closely adjacent frequency and 
allowed to beat with the aural harmonic These beats usually 
arc noticeable even when the individual harmonic is obscured 
by a louder fundamental Thus, the fourth harmonic of a 
* 500-cycle tone, 80 db above threshold, may not stand out by 
itself, yet, when a frequency of 2003 cycles is introduced, faint 
beats will be heard at the rate of 3 per second Furthermore, 
184 



INDIRECT MEASUREMENT OF AURAL HARMONICS 


185 


since beats are strongest when the intensities of the beating 
tones are equal (see Chapter 9), we have reason to believe that 
the most noticeable beating will occur when the intensity of 
the 2003-cycle tone equals that of the aural harmonic There 
fore, by adjusting the strength of the auxiliary tone until the 
best beats are heard, we can obtain a fair indication of the mag 



ihe fundamental (first harmonic) has an intensity as indicated 
Example when the intensity level of the first harmonic, or fundamental is 
100 db the intensity levels of the successive higher harmonics are 86 73, 62 
52 42 etc, db (After Fletcher, 2 ) 

mtude of the harmonic generated in the car Combination 
tones also yield to this method of attack, and it is possible to 
demonstrate the presence of large numbers of them when two 
loud tones are sounded (Wegel and Lane) 

Using this method of ‘best beats,’ Fletcher (2) was able to 
construct the curves shown in Fig 76 Here we have a set of 


186 


AURAL HARMONICS AND COMBINATION-TONES 


functions giving the relative intensities of all the harmonics gen- 
erated in the ear in response to stimulation by a pure tone (first 
harmonic) of known intensity-level. In drawing these curves, 
Fletcher made the reasonable assumption that the amount of 
distortion imposed upon a tone during its transmission through 
the middle ear is dependent only upon the intensity-level of the 
sound, and not upon its sensation-level. Hence, the relative 
size of the harmonics is independent of the frequency. In 
utilizing these curves \vc must remember that they are some- 
what idealized, and that any individual case is likely to vary 
considerably from these values. Nevertheless, Bek£sy (17) 
made some measurements on a 200-cycle tone which agreed 
very well with the prediction of Fig. 76. 



fundamental frequencies first appear as the intensity is raised from zero 
(After Fletcher, I. Courtesy of D Van Nostrand Company, Inc ) 

One important consequence of Ac dependence of the size of 
the aural harmonics on the intensity-level is seen when we 
translate these intensity-levels into sensation-levels. Owing to 
the form of the curve for the threshold of audition (Fig. 17, 
p. 50), a given intensity-lc\ el represents different sensation- 


DIRECT MEASUREMENT OF AURAL HARMONICS 


187 


levels at different frequencies. Hence, when we express the 
magnitude of the aural harmonics of a 60-cycle tone in terms 
of sensation-level, we find that, for a fundamental at an inten- 
sity-level of 100 db, the sensation-levels of the first five har- 
monics are 44, 46, 44, 38, and 30 db. The second harmonic 
has a higher sensation-level than the first. 

Not only are the subjective effects of aural harmonics more 
prominent for low tones than for high, but the harmonics first 
make their appearance at a lower sensation-level when the fre- 
quency of the fundamental is low. Figure 77 shows the sensa- 
tion-level of the fundamental at which the various harmonics 
first become detectable (Fletcher, 1) In order to obtain these 
curves, the pure fundamental tone was sounded at various levels 
and the presence of the harmonic was determined by means of 
an auxiliary tone which beat with the harmonic. For tones 
above 1000 cycles, no harmonics are generated until the sensa- 
tion-level is about 50 db above threshold. 

DIRECT MEASUREMENT OF AURAL HARMONICS 

Our approach to the problem of aural harmonics in human 
ears is at best indirect, and the results of measurement are vari- 
able and sometimes of equivocal significance. A more direct 
attack is possible in the ears of animals, for there we can record 
and analyze the electric potential generated in the cochlea as 
a response to auditory stimulation The nature of the coch- 
lear potentials is the subject of Chapters 13 and 14, but for 
present purposes it is sufficient to point out that, whenever a 
sound-wave enters the ear, it is transformed into an electric 
wave of nearly the same form (The microphone used in radio 
tanadra&TKg pwfcsTna pits-Yidy sm 
sound into electrical energy ) The cochlear microphonics can 
be picked up by electrodes placed in contact with the exposed 
cochlea of an animal and amplified for purposes of study. 
Thus, with one electrode near the round window of the cochlea 
and the other in contact with some other part of the animal, we 
obtain a potential which may be taken as an index of the 
effective sound-energy reaching the end-organs of the auditory 



188 


AURAL HARMONICS AND COMBINATION TONES 


mechanism. Then, with a wave-analyzer (Chapter 1), which 
can be tuned to respond separately to each component of the 
electric potential, we can measure the amplitude of the funda- 
mental and of each aural harmonic as it exists within the cochlea 
(Stevens and Newman, 3). 

Typical results, obtained from a cat and from a guinea-pig, 
are shown in Fjgs. 78 and 79 respectively. The ear was stim- 



Fig 78 Analysis of the cochlear microphonics obtained from a cat's ear 
when stimulated by a pure tone of 1000 cycles Abscissa values represent the 
intensity of the stimulus in decibels above the average human threshold The 
uppermost curve shows the magnitude of the fundamental frequency in the 
cochlear microphonia, and the other curves are for the higher harmonia, as 
indicated. (Stc\ ens and Newman, 3 ) 

ulated by a pure tone, and the relative magnitude of the funda- 
mental and of each of the harmonics in the electric potential 
picked up from the cochlea was measured by the wave-analyzer. 
In each case the stimulus was a very pure tone of 1000 cycles, 
whose intensity is represented along the abscissae. The mag 
nitudes of the different components of the cochlear micro* 
phonics are plotted as ordinates against the stimulus-intensity. 

’ The fundamental has the greatest magnitude. It is represented 




DIRECT MEASUREMENT OF AURAL HARMONICS 


289 


by the line connecting the solid black dots in the figures. It 
will be seen that the fundamental increases almost linearly until 
it is about 20 per cent of its maximal value, or about 15 db below 
the maximal value. From this point on, an increase in the 
strength of the stimulus gives a proportionately smaller and 
smaller increase in the response of the cochlea, until finally the 
maximum is reached. Any further increase in the stimulus 



Fig 79. Analysis of the cochlear microphonics obtained from a guinea pig’s 
ar. (Similar to Fig 78 ) (Stevens and Newman, 3.) 


results possibly in an increase in the size of the harmonics, but 
may soon lead to permanent injury of the ear. 

When, at any point along this curve, such as, for example, 
the 70-db sensation-level, the remainder of the potential is 
analyzed, we find present the second harmonic with a magni- 
tude, represented by the open circles in the figures, usually 15 to 
20 db below the fundamental; the third harmonic, represented 
by the crosses, 10 to 15 db lower; and the fourth and fifth har- 
monics still lower. The same analysis is repeated at the other 
sensation-levels. 

It should be noted at once that, below a sensation-level of 




190 


AURAL HARMONICS AND COMBINATION TONES 


about 50 db, the magnitude of all aural harmonics is so small 
that they he below the limit of our ability to measure them, just 
as they do in the human ear at this same frequency of 1000 
cycles (Fig 77) Above this level the harmonics increase with 
an increase in the intensity of the stimulus, but they grow even 
more rapidly than the response of the fundamental They in 
crease, not only in absolute magnitude, but also in the relative 
proportion that they are of the total response — a fact consistent 
with the curves of Fig 76 However, whereas the odd num 
bered harmonics, the third and the fifth, appear to reach a 
maximum beyond which they fail to increase, the even har- 
monics, the second and fourth, not only reach a maximum but 
decline substantially at higher intensities, both in absolute and 
in relative magnitude 

The results reported for 1000 cycles are thoroughly typical 
of those obtained at other frequencies (Newman, Stevens, and 
Davis) This fact is easily demonstrated at higher frequencies, 
but the interpretation of experiments on lower frequencies re 
quires the evaluation of certain factors In the first place, at 
frequencies below 1000 cycles there is usually present a large 
amount of disturbance created by the electrical activity of the 
auditory nerve These so-called action potentials tend to distort 
the wave form of the cochlear microphomcs and thercb) intro- 
duce spurious harmonics In the second place, the sensitivity 
of the ear is different at different frequencies, so that, as we have 
just seen, there is a difference between the intensity level and 
the sensation level of a sound Since the potential generated 
in the cochlea is a measure of the sensation level of the effective 
stimulus, rather than of the intensity level, wc obtain apparently 
greater distortion at the low frequencies In other words, at 
low frequencies the harmonics arc relatively more prominent, 
because the car is more sensitive to them than to the funda 
mental 


ODD VERSUS EVEN HARMONICS 

It is evident from Figs 78 and 79 that the odd and the even 
harmonics behave quite differently Not only do the even 



ODD VERSUS EVEN HARMONICS 


191 


harmonics pass through a maximum as the intensity of the 
stimulus is increased, but they also tend to show great variability 
under experimental tests The odd harmonics are generally 
more stable and can be measured with greater reliability 
An important example of the ease with which the even har- 
monics can be modified experimentally occurs in some measure 
ments on a guinea pig This animal shows, under anesthesia, 
a convenient disposition to contract spasmodically the muscles 
in its middle ear. These contractions demonstrate that the 
muscles are able to function, and there 1 $ reason to believe that, 
between the contractions, the muscles maintain something like 
normal tonicity Additional anesthesia may abolish both the 
contractions and the “normal” tonicity What, then, is the 
effect of tension of the muscles of the guinea pig’s ear on aural 
harmonics ? Figure 80 presents a comparison of the results 



Fig 80 Showing how a change of tonus of the muscles of the middle ear 
of a guinea pig affects the second and third harmonics Each curve is an 
average of values for stimuli of 1000 1750, and 2500 cycles The abscissa 
gives the intensity of the stimulus referred, for each frequency, to the intensity 
necessaiy for Of per cent of the maximum response of the fundamental 
The dotted curves show the amount of harmonic remaining after the muscles 
are relaxed (Stevens and Newman, 3 ) 

obtained, first, in a “normal” condition (i e, while the tensor 
was contracting occasionally), and, second, after the contrac 
tions had ceased The sizes of the harmonics under the condi 
tion of tonus are shown by the solid lines, and the sizes under 
relaxation by the dotted lines It is clear that the size of the 



192 


AURAL HARMONICS AND COMBINATION TONES 


second harmonic is markedly reduced, whereas the magnitude 
of the third harmonic remiins essentially unaltered (Some of 
this change in the second harmonic may have been due to the 
behavior of action potentials, or to other factors difficult to 
evaluate ) 

With the cat, no such convenient spasm occurs in the middle 
ear However, the cat’s tensor tympam muscle can be exposed, 
so that the tendinous attachment of the muscle can be cut at a 
point near the eardrum Results of this operation are show n m 
Fig 81 The solid curies represent again the normal condition, 
whereas the broken lines 
are for values measured 
after the muscle had been 
cut free Here again the 
second harmonic has 
been markedly altered, 
but the third harmonic 
exhibits its usual stability 
Now, it is not entirely 
unexpected that the odd 
and the even harmonics 
should be independently 
variable In any trans 
mission system produc- 
ing distortion, as in the 
middle ear, the character- 
istic which gives rise to 
the e\en harmonics is separate from that which produces the 
odd A system which is nonlinear, but symmetrical, generates 
only odd harmonics, as when amplifying tubes are connected 
in ‘push pull’ A system which is asymmetrical furnishes us 
with even harmonics Consequently, we are forced to con 
elude that the ear is both nonlinear and asymmetrical, and that 
the degree of asymmetry is subject to experimental control 
These notions form the basis for the discussion, in the next 
? section, of the characteristic curve of the ear 

It should be pointed out here again that part of the variability 



tympam of a cat upon the magnitude of 
the second and third harmonics The co- 
ordinates are the same as those xn Fig 80 
The dotted curves show the amount of 
harmonic present after cutting (Stesens 
nnd Newman 3 ) 




THE TRANSMISSION CHARACTERISTIC OF THE EAR 


193 


in the behavior of the even harmonics at low frequencies is due 
to the admixture of action potentials among the cochlear poten 
tials The action potentials tend to appear as even harmonics 
when viewed through a wave analyzer 

THE TRANSMISSION CHARACTERISTIC OF 
THE EAR 

The behavior of aural harmonics suggests a definite hy- 
pothesis of their origin With sounds of small amplitude, the 
response of the ear is essentially linear and symmetrical, so that 
no harmonics occur below an intensity level of about 45 db As 
the mtensity is increased and the auditory mechanism is made to 
vibrate with larger amplitude, first one portion of the mecha 
nisra and then another reaches a constraining limit beyond 
which Hooke’s law breaks down (Hooke’s law tells us that, m 
a linear system, the resulting displacement is proportional to the 
force applied ) When the displacement passes such a limit, the 
function relating sound pressure to the response of the ear is 
distorted from linearity It is the graph of this function which 
is referred to by the term characteristic curve, a term widely 
used in describing the properties of vacuum tubes Then the 
position of the system on such a curve, when no force is applied, 
is called the operating point, and under a smusoidal force, the 
system moves back and forth along the characteristic curve on 
either side of this pomt of rest. An example of a characteristic 
curve is shown m Fig 4 (p 14) 

When the curve is symmetrical on both sides of the operat 
mg pomt, nonlinearity leads exclusively to the production of 
odd harmonics, and the even harmonics arise only when some 
degree of asymmetry occurs Consequently, the simplest hy 
pothesis regardmg the behavior of the ear would hold that, 
when a smusoidal pressure is impressed on the ear and its ampli 
tude is gradually increased, the peaks of displacement in one 
direction soon reach a limit beyond which the characteristic 
curve becomes bent At this pomt even harmonics appear The 
amount of even harmonics will increase, and at some greater 
amplitude the peaks of displacement in the opposite direction 



194 


AURAL HARMONICS AND COMBINATION TONES 


also arrive at a point where linearity stops Here begins the 
production of odd harmonics, for here the movement of the 
ear encounters nonlinearity in both directions at once 

The differences we have noted between the odd and the 
even harmonics, with respect both to magnitude and to vanabii 
lty, can be accounted for m terms of this scheme Particularly 
important is the fact that this hypothesis enables us to explain 
how a marked modification of the even harmonics might be 
produced by a change of tension in the muscles of the middle 
ear If we concave of the two limits of linearity as being reta 
tively fixed, it is clear that a pull from the muscles of the ear 
might shift the operating point along the characteristic curve 
and leave it in some new relation to the mid point between the 
limits Then, as experiment has suggested, changes in the 
tension of these muscles might increase or decrease the relative 
amount of even harmonics 

These relationships can most easily be understood if we con 
struct an approximation to the over all characteristic curve of 
the ear (Fig 82) We must plot the curve for the response of 
the fundamental, as is shown in Fig 78 However, we must 
plot it in linear instead of logarithmic coordinates Then the 
long linear portion of the logarithmic curve becomes a small 
straight segment very near the operating point, O, and the re 
suiting curve represents the upper half of the characteristic 
curve, as is show n m Fig 82 The lower half is simply an image 
of the upper, and the two together give us the complete function 
which relates sound pressure to the potential generated in the 
ear Now, if the car were symmetrical, the operating point, 
where the system would find itself m the absence of any sound, 
would be at O The mechanical structure of the car and 
possibly the tension of its muscles tend, however, to displace 
the operating point to some asymmetrical position, such as posi 
non A Then, any sinusoidal force operating about point A 
will produce a motion having both odd and even harmonic 
components 

This working picture of the car, if correct in principle, would 
mean that for those instances m which the even harmonics dc 



THE TRANSMISSION CHARACTERISTIC OF THE EAR 


195 


cline at high intensities (Figs. 78 and 79), under large sound- 
pressures, the operating-point tends to move to a more symmetri- 
cal position on the characteristic curve. This effect could be 



Fic 82 Approximate characteristic curve of the ear The curve relates 
the electric potential of the cochlear microphomcs (ordinate) to the applied 
sound pressure in dynes per square centimeter Thus when the pressure 
vanes sinusoidally about the zero value, the instantaneous voltage m the 
cochlea may be read from the curve and the resulting wave form determined 
Factors such as tension on the muscles of the middle ear may shift the operat 
ing point (point corresponding to zero pressure and zero voltage) from O to 
some other point such as A With the operating point at A, both odd and 
even harmonics are present m the output. (Stevens and Newman, 3 ) 

achieved in either of two ways. Either the operating-point 
could move to a new position on a fixed characteristic curve, or 
else the curve itself could undergo alteration at large amplitudes. 
The car is a very complex structure and as yet we have no means 
of telling precisely what determines the form of its characteristic 
curve. 



196 AURAL HARMONICS AND COMBINATION TONES 

Nevertheless, the effectiveness o£ the middle-ear muscles in 
changing the proportion of even harmonics argues that the 
curvature in Fig 82 depicts a property of the middle car On the 
other hand, the middle and inner ears constitute a closely 
coupled mechanical system, so that it is not impossible to con 
ceive of the harmonics as originating in the inner ear itself 
(Bekesy, 17) Certain it is that persons whose eardrums and 
ossicles are missing from the middle ear experience distortion 
(Lewis and Reger), but this fact is to be expected, and it docs 
not argue that, in the normal case, the middle ear is not respon 
sible for whatever distortion we hear Furthermore, in any 
discussion of the locus of auditory distortion, %%e should remem 
ber that the question is not whether any particular part of the sys- 
tem is nonlinear (it must be at large enough amplitudes), but the 
question is rather which part of the system is most nonlinear, 
so that its limits of linearity arc the first ones reached when the 
amplitude is increased It is this part of the system which will 
determine the form of the characteristic curve, and it is quite 
possible that different parts of the system perform this function 
at different intensity levels 

Independent support for the notion that distortion might be 
blamed upon the middle ear comes from a recent study of the 
exact mechanical motion of the auditory ossicles (Stuhlman) 
Beginning with a series of very careful measurements of the 
dimensions and relative positions of the ossicles Stuhlman con 
structed a scale model of the middle ear, so that he could study 
its rather complicated motions Within the limitations set by 
the model in simulating the correct suspension and articulation 
of the ossicles, the experimental evidence showed that a simple 
sinusoidal motion, impressed at the eardrum, is transmitted to 
the oval window only after having undergone both asymmet 
eical and nonlinear distortion No distortion is introduced at 
small amplitudes From his observations, Stuhlman was able to 
plot the characteristic curve for the ossicles (see Fig 107, p 258) 
This curve, with its sigmoid form and its asymmetrical operat 
ing point, is obviously similar to that in Fig 82 



COMBINATION TONES 


197 


COMBINATION TONES 

Simultaneous stimulation of the ear by two pure tones pro- 
duces an electric potential, out of which not only the several 
harmonics of these two tones but also the sum and difference- 
tones representing combmations of these harmonics can be 
analyzed (Newman, Stevens, and Davis) The presence of 
these tones is a necessary consequence of a characteristic curve 
like that in Fig 82 The procedure here was identical with 
that used to mvestigate the aural harmonics the electrical out- 
put of the cochlea of an animal was analyzed by means of a 
wave analyzer This method shows clearly th at two pure tones, 
led simultaneously to the ear, produce in the cochlea all the 
combination tones that can be easily detected by the method of 
‘best beats’ and a great many more besides 



phonics when the car of a cat is stimulated by two pure tones of 700 and 1200 
cycles Abscissa values represent the intensity of the two stimuli in decibels 
above the average human threshold The ordinate scale is in decibels below 
the maximal average value of the two fundamentals (Newman Stevens, and 
Davis ) 

Typical behavior of the combination tones resulting from 
two primary frequencies (700 and 1200 cycles) at various mten 
sities is shown in Figs 83 and 84 These curves are essentially 
representative of the usual course of the combination tones, but 




198 AURAL HARMONICS AND COMBINATION TONES 

any particular curve is subject to considerable variability in 
different experiments Figure 83 represents the difference 
tones, and Fig 84 the summation tones In general, the differ- 
ence tones are larger than the summation tones of correspond- 
ing order (The order of a combination tone is here defined as 
the number of the highest harmonic entering the combination ) 
Thus, the first-order difference tone (500 = 1200 — 700) is 
larger than the first-order summation tone (1900 = 1200 + 
700), although both these tones appear at rather low levels of 
the fundamental tones and follow functions very similar to 



man, Stevens, and Davis ) 

one another The second-order difference tone (1700 = 2 X 
1200 — 700) consistently reaches the largest value of all com 
bination tones Its complement (200 = 2 X 700 — 1200) is not 
so large and tends to decline at high intensities Among the 
summation tones, one of them (3100 = 2 X 1200 -f- 700) 
reaches the largest value, but does not greatly exceed its com 
plement (2600 = 2 X 700 1200) 

Although the combination tones represented m Figs 83 and 
84 include the largest present m the ear, there are many others, 
including some that surpass occasionally the smaller ones in 
the figures A thorough exploration with a wave anal>zcr 
(Newman, Stevens, and Davis) of the frequency range 100 to 



COMBINATION TONES FROM SUPER AUDIBLE FREQUENCIES 199 


8000 cycles yielded a grand total of 66 different tones present in 
the cochlear response of a cat stimulated by 700 and 1200 cycles 
at 90 db above threshold If we let L represent the lower tone, 
700 cycles, and U the upper tone, 1200 cycles, Table IV gives 
the magnitude of the tones corresponding to various combina 
tions of U and L taken singly and together It will be noted 
that there are m this table as many as 3 tones of the se\enth 
order and 10 tones of the sixth order The only tones below 
6000 cycles involving combinations of the sixth order or less 
which were not found were 600 =* 4U — 6L or 6L — 3U, 
1100 = 5L — 2U, 2500 = 5U — 5L, and 5800 = 6U — 2L 
The absence of these 4 combination tones may be due simply 
to their failure to reach the arbitrary criterion of 0 1 per cent 
of the fundamental 

The astonishing number of combination tones present in 
the ear, when the external stimulus is a single pair of pure tones, 
shows us how complex the spectrum of a sound becomes in the 
process of transmission to the inner ear Except at low sensa 
turn, levels, the spectrum in the inner ear is vastly more complex 
than m the outer ear Imagine, then, the degree of complexity, 
within the cochlea, of an orchestral strain composed itself of 
many component frequencies 

COMBINATION TONES FROM SUPER AUDIBLE 
FREQUENCIES 

It has been reasoned that, since the nonlinearity of the ear 
allows us to hear a difference tone when two tones are sounded, 
we should be able to detect the presence of two super audible 
frequencies by reason of the difference tone which they would 
produce Thus, if tones of 25,000 and 26,000 cycles were pre 
sented together, we might hear a 1000-cycle difference tone, 
even though the two high frequencies are separately inaudible 
Such, however, is apparently not the case Recent improve 
ments in high frequency generating devices (Pierce) have made 
it possible to produce super audible sound waves at very high 
intensities Nevertheless, attempts to produce difference tones 
m the ear by means of these high frequencies have failed com 



200 


AURAL HARMONICS AND COMBINATION TONES 


pletely Such a failure is not surpnsing in view of the fact that 
the response of the car is linear for small amplitudes of motion 
The over all mechanical tuning of the ear is such that its motion 
is very slight when being driven at a high frequency — much too 
slight, apparently, for distortion to appear when available inten 
sities of super audible tones arc employed 

On the other hand, when a particular ear is suffering from 
what we call high tone deafness, tones which are separately 
inaudible may produce a difference tone when presented to- 
gether Thus, m an ear which was unresponsive to tones above 
6000 cycles, it was found possible to produce an audible differ 
ence tone of 1000 cycles by stimulating the car with tones of 
7000 and 8000 cycles These stimulating tones were sufficiently 
intense and low enough in frequency to force the car beyond 
its limits of linearity and thereby produce a difference tone 
The mere fact that the car could not hear the two primary tones 
had no effect on this process of distortion, for the high tone 
deafness was probably due, in this instance, to a deficiency m 
the neural rather than m the mechanical mechanism of the car 

THE THRESHOLD FOR DISTORTION 
In the electrical recording 3nd transmission of speech and 
music, it is the aim of the engineer to design equipment that 
will not introduce audible distortion The fact that the car 
itself generates harmonics and combination tones affects, how 
ever, the amount of objective distortion which can be tolerated 
in acoustical instruments A simple method of measuring the 
audibility of distortion m a tone is to determine the amount of 
second harmonic which must be mixed with a pure tone in 
order to produce a just noticeable effect An experiment m 
which the sensation level of a 740-cyclc tone that was just de 
tectable by its audible effect on a tone of 370 cycles gave the 
results shown in Fig 85 (Newman Stevens and Davis) 



g§§i§lal§sis§§§s§§ss 










202 


AURAL HARMONICS AND COMBINATION TONES 


The graph shows that at low sensation levels the second 
harmonic introduced distortion almost at its absolute threshold 
value Hence, if masking is defined as a raising of the thresh 
old, it is apparent that masking of the added harmonic is 
negligible below a sensation level of 40 or 50 db From 50 to 
80 db, however, the amount of harmonic necessary for an audi 
ble change increases rapidly, first in absolute magnitude, and 
later m relative magnitude as well, but above 80 db the curves 
for the two observers flatten out significantly 



F»c 85 The sensation level of the second harmonic, generated externally 
to the ear, which is just detectable when mixed with the fundamental (370 
cycles) at vanouj sensation levels Each curve ts for a single human observer 
(Newman Stc\ ens and Davis ) 

The qualitative character of the audible change produced 
by adding this harmonic was different at the various sensation 
levels of the fundamental At low levels the harmonic was 
usually heard as a separate tone In the middle region it was 
heard as a sharpening or brightening of the timbre of the tone, 
whereas at high levels the changes were so complex and so 
dependent upon differences of phase that any generalization 
about their character would be misleading 

The relation of this threshold for distortion to the aurally 
generated harmonics is reasonably clear Masking of the ex 
ternally generated harmonic arises at about the level at which 
the aural harmonic begins first to appear At somewhat higher 



THE EFFECT OF PHASE RELATIONS AMONG HARMONICS 203 


levels, it is plain that various forms of interference between 
the two types of harmonics are responsible for a wide variety of 
subjective effects It is also in this region that phase relations 
between the fundamental and the externally generated har- 
monic become important Finally, the flattening-off at the tops 
of the curves is reminiscent of the form of the functions describ 
ing the aural harmonics (Fig 78) The course of the threshold 
for distortion and its parallel to the aural harmonic thus suggest 
that the threshold for distortion stops increasing at the point 
where the aural harmonic ceases to grow 

THE EFFECT OF PHASE EEL AT 1 OHS AMONG 
HARMONICS 

Not only does the phase of a harmonic that is present in the 
stimulus have an effect upon the threshold for distortion, but 
it may also influence the subjective effects of a complex tone 
This statement is contrary to the usual assertion that, under 
Ohm’s auditory law, the ear tends to analyze the components 
of a complex sound regardless of their phase relations Those 
experiments in which an auxiliary tone was made to beat with 
an aural harmonic prove definitely that the phase relations 
among the harmonic components of a stimulus are detectable, 
for otherwise these beats could not occur A harmonic in the 
stimulus may reinforce or cancel an. aural harmonic U nder the 
method of ‘best beats’ we perceive an alternate reinforcement 
and cancellation due to a constantly changing phase between 
the auxiliary tone and the aural harmonic When, however, 
the auxiliary tone is identical in frequency with the aural har 
momc, no beats are heard, but it can be shown that the auditory 
experience which occurs nevertheless depends upon the phase- 
relation between the auxiliary tone and the aural harmonic 

There is one phase relation between these two tones which 
gives a definite increase in loudness, both of the harmonic and 
of the total experience, there is another which decreases the 
loudness In other words, a given tone, plus another tone of 
exactly twice the frequency, may sound either louder or less 
loud than the fundamental alone Furthermore, the phase 



204 


AURAL HARMONICS AND COMBINATION TONES 


yielding maximal loudness differs from that giving minimal 
loudnessby 180°. 

The effects of adding various amounts of second harmonic in 
three different phase relations arc shown in Fig. 86. To a tone 
of 108 cycles, at an intensity-level of KH db, was added a 216- 
cycle tone in such phase- relations as to give (1) maximal loud- 
ness, (2) minimal loudness, and (3) intermediate loudness. 
Curve A shows how the experienced loudness changes when 
the added harmonic is set to give minimal loudness. In this 



Fig 86 Showing how the total loudness changes when a 2l&cycle tone 
(second harmonic) is added to a 103-cycle tone in three different phase- 
relations The change in loudness is measured in terms of just perceptible 
steps (DL’s). Curve A is for the phase-rclation giving minimal increase in 
loudness, curse B is for intermediate and curve C is for maximal increase in 
loudness The tone represented by curse A is 150° out of pliase svitb the 
tone represented by curve C (After Chapin and Firestone ) 

case, the 216-cycle tone is opposite in phase to the second aural 
“harmonic and cancellation occurs. Consequently, as the inten- 
sity of this added harmonic is increased, the loudness declines, 
until finally the second aural harmonic is completely canceled. 
Thereafter, the loudness increases with added intensity of the 
harmonic. Presumably, then, where curve A reaches a mini- 


THE EFFECT OF PHASE RELATIONS AMONG HARMONICS 205 


mum, the added harmonic is equal in magnitude and opposite 
in phase to the aural harmonic, and the occurrence of this 
minimum at about 87 db is consistent with the curves of Fig 76 
Curve C represents the situation where the added harmonic is 
180° out of phase with the harmonic used to obtain curve A, 
and is presumably in phase with the aural harmonic Hence, 
loudness always increases when the intensity of this added har 
inomc is raised Curve B shows how an intermediate phase 
produces an intermediate effect 



Fig 87 Showing how intense an added tone, equal to the difference 
between two other tones must be in order to produce a detectable increase 
in the loudness of the aural difference tone. The phase of the added tone is 
shown by the abscissa At certain phases, the added tone is below the thresh 
old of audibility for that frequency, as shown by its negative sensation level 
Plots A and B are for two different observers (After Lewis and Larsen ) 

Not only docs a change of phase alter the loudness of a har 
momc, and of the total experience, but it produces noticeable 
differences m quality, provided the fundamental is a low tone 
of 100 cycles (Trimmer and Firestone) The phase relation 
giving minimal loudness is characterized by smoothness, where 
as the opposite phase, which leads to maximal loudness, carries 



206 


AURAL HARMONICS AVD COMBINATION TONES 


with it a rough or dissonant element The phase giving 
minimal roughness was found to be almost the same as that 
giving minimal loudness, although considerable variability oc 
curred among different observers 

Just as the aural harmonics can be interfered with by a tone 
of the proper frequency and phase, so also can we investigate 
combination tones by adding to the stimulus frequencies equal 
to the tones generated m the ear Lewis and Larsen worked 
with a difference tone of 130 cjclcs created by the two frequen 
cics, 390 and 520 cycles, at an intensity level of 70 db They 
measured the intensity of a 130-cycle tone which, when added 
to the combination m various phase relations, was just sufficient 
to make the difference tone sound noticeably louder The re 
suits for two observers are shown in Fig 87, where we see that 
much less energy is needed at certain phases than at others On 
the assumption that, in order to make the difference tone just 
noticeably louder, the least energy is required when the added 
tone is exactly in phase with the difference tone, wc may dc 
tcrmine this phase relation from the lowest points on the curves 
of Fig 87 Apparently the difference tone has a phase of 20 0° 
in the C3r of observer A and of 320° in the car of observer B 

THE STABILITY OF THE CHARACTERISTIC 
CURVE 

Now, we should like, of course, to be able to explain all the 
results of the experiments considered in this chapter m terms 
of the approximate characteristic curve drawn in Fig 82 The 
problem would be greatly simplified if wc could assume that 
this curve portrays the characteristics of a typical car under all 
conditions of stimulation However, before wc can feel secure 
in this assumption, certain difficulties must be explained Our 
limited experimental evidence mdicates that the relative phases 
of the aural harmonics and combination tones are disconcert 
mgly different from ear to ear Likewise, the magnitudes of 
these tones show considerable variation among different ob 
servers Why distortion in different ears should exhibit such 
lack of uniformity is not dear at present 



THE STABILITY OF THE CHARACTERISTIC CURVE 


207 


Another complication, waiting to be explained, is the fact 
that the size of the aural harmonics, as measured directly m 
the ear of a cat, is greatly altered by the presence of another tone 
(Newman, Stevens, and Davis) Thus, the third aural har- 
monic of a 700-cycle stimulus was found to be reduced by about 
20 db when, a 1200-cycle tone was sounded simultaneously 
Not only were the aural harmonics reduced, but they changed 
size very irregularly as a function of mtensity A fixed char- 
acteristic curve would not by itself account for this effect The 
solution of these difficulties waits for further experimental 
evidence. 



CHAPTER 8 


AUDITORY MASKING FATIGUE 
AND PERSISTENCE 

There arc, in general, two conditions under which a normall) 
effective auditory stimulus may fail to arouse a sensation One 
is when it is accompanied by another sound which obliterates or 
masks it, the other is when it is preceded by a sound which 
leaves the organism unresponsive or fatigued 

MASKING 

One tone is said to produce masking when it raises the thresh 
old of a second tone We have already seen how it is possible 
to use data from experiments on mashing to calculate the loud 
ness of certain sounds (Chapter 4) Here we shall examine 
other problems connected with auditory masking 

The measure of masking is the number of decibels that the 
threshold of the masked sound is elevated in consequence of the 
presence of the masking sound The simplest method of meas- 
uring masking is to turn on the masking sound and then grad 
ually to increase the intensity of the masked sound until its 
presence is just detectable The difference between this value 
of the intensity of the masked sound and jts threshold uiten 
sity when no masking sound is present is the masking value 
The temporal relations in the presentation of the two sounds 
and the number of observations made with them are, of course, 
important variables in an experiment on masking (Wcvcr and 
Truman) 

Wegcl and Lane made an extensive study of the masking 
effects of pure tones and found that these effects vary greatly 
with the frequency and intcnsitne relations of the tones Fig 
ure 88 shows the amount by which tones of various frequencies 
are masked by the presence of another tone at different sensation 
levels Each plot is for a masking tone of different frequency 
20 S 



MASKING 


209 


Fp, as indicated, whose sensation-level is shown by the abscissa. 
The ordinates give the change in threshold — the masking — 


B unimul 
■■■■■■■Bail 

issssss^s 

isssrsgs 


■■aais^al 

iae;r^rririaaal 


■■■■■■KSi!RSSa| 

i|aaaK%»aaal 

pBHRKarliasal 


iHIUHRial 
liacrnRHil 
!■■■■■(?: jrl 

fSs^2sss,imsm 


lucuiraal 


laazaaKgaai 

■r - r -- 


■3 ESBn^dBI 

iiiaais'/'iaaai 
iiaaggar aBil 

iar.i 2 aa i 'saB| 

^gaaaigKaal 


Fig 88. Curves showing the ability of different frequencies F p (indicated 
on each plot) to mask other frequencies (indicated by the numbers attached 
to the curves). The sensation level of the masking tone is shown by the 
abscissa, and the elevation in the threshold of the masked tone by the ordinate. 
(After Wcgel and Lane ) 


suffered by the frequencies indicated by the separate curves. 
From these plots, certain general facts are evident. A tone of 




210 


AUDITORY MASKING FATIGUE AND PERSISTENCE 


a frequency much below the masking tone is not perceptibly 
masked when the intensity of the masking tone is low, and even 
when the masking tone is loud the effect on a tone of lower 
frequency is but slight A tone of much higher frequency 
than the masking tone is not perceptibly masked by a weak 
masking tone, but, as the intensity of the masking tone is in 
creased beyond a certain value, the degree of masking grows 
rapidly In other words, a loud tone masks tones higher than 
itself more easily than it masks lower tones In general, mask 
ing is greater when the tones lie close together in frequency 
In this event, the curves tend to approach lines with 45° slopes 
which intercept the axis of the abscissae at a sensation level of 
the masking tone equal to about 20 db 

When the tones arc close enough together m frequency to 
heat, they do not give nsc to masking m the same sense as when 
farther apart Measurements of masking then represent the 
minimal perceptual fluctuation of the beating tone In the 
special instance when two such tones arc so faint as to be sepa 
rately just inaudible, they will, when introduced into the ear 
together, beat in such a way as to be alternately audible and 
inaudible Hence, in this case, we actually obtain a reinforce 
ment, l e , a negative value of masking 

The sudden increase in the slopes of those curves in Fig 88, 
for which the masked frequency is higher than the masking 
frequency, is associated with the appearance of aural harmonics 
This effect is illustrated in Fig 89, which represents the masking 
due to a 1200-cycle tone at a sensation level of 80 db The solid 
curve resembles such a curve as wc might expect if three mask 
ing frequencies, 1200 2400, and 3600 cycles were present, with 
relative magnitudes of 40 4 1 These frequencies were not, 
however, present in the stimulus — they were introduced by 
distortion due to the nonlinear transmission of the ear Wcgcl 
and Lane determined the magnitude of these harmonics by 
measuring the intensity at which another tone, differing a few 
cycles from the harmonic, produced the most prominent beats 
when sounded simultaneously 

The character of the sensation caused by two tones, acting 



MASKING 


211 


simultaneously on the ear, varies considerably with the relative 
frequency and intensity of the tones, because the same non 
linearity which introduces harmonics at high intensities gives 




mm 

■ 


m 

III 

IIIIIIIIIV 

o 


n , u 

XTURE 

TONES 

’jM 

jj j 

mmmii 

1 


1 

RUART 

m 

A 


ij 

zt ^ 

|| j 

0 6C 



>FFEh£nc 

TONE 

V 

\ 

> T 

OFFEBEn 

T'l- 

V; 

1 


■ 

m 



A*fi AND 

; 

IKHSil 

1 


IK 

i 


■1 

III 

II 

llllllll 




7 

o 

■ 

PS 

gjg 

» 

ilium 

z 




5 

* 

■ 

■1 

III 

11 

ilium 


00 « 

00 a 

OO ( 

1 

oo 

Q i 

■1 

KI 

III 

II 

ML 

iiiiiiii 


rRcauCKcr or secondary 1 tone 


Fig 89 The \arious sensations produced by two pure tones one (the 
pr mary) at 1200 cycles and 80 db above threshold the other at a frequency 
and sensation level as shown by the coordinates (After Wegel and Lane ) 

rise to combination tones when both tones are sufficiently loud 
Thus Fig 89 represents the sensations obtained when a 1200- 
cycle tone, at 80 db above threshold, is combined with different 
secondary frequencies whose sensation levels are indicated on 
the ordinate of the graph The various areas represent ranges 
of frequency and intensity of the secondary tone in which com 
bmation tones of various kinds, as indicated, appear When 
any secondary tone of a frequency below about 1000 cycles is 
raised in intensity from a sub audible value to a point at which it 
is just detectable, it is first heard as a separate tone along with 
the primary tone In the lower part of this frequency range, 
the intensity of the secondary tone may be increased to very 
large values and the tone still will be perceived independently 
of the primary When, however, the intensity of the secondary 
tone is increased to a point indicated by the dotted line, the 
difference tone is distinguishable and increases gradually in 




212 AUDITORY MASKING TATIGUE, AND PERSISTENCE 

relative intensity as the area above this line is crossed At very 
high intensities, in this region, a complex mixture of tones 1 $ 
heard On the other hand, w hen a higher secondary tone, 1900 
cycles for example, is introduced in the same way, its presence 
is first detected by the appearance of a difference tone, and the 
secondary itself is not heard Then, as the intensity is further 
increased, the secondary tone becomes audible along with the 
difference tone, and, with stilJ further increase in intensify, the 
mixture oS tones becomes more and more complex All these 
effects are what might readily be predicted from a study of 
the facts presented in the previous chapter 

A special interest attaches to the mashing effects of very low 
tones, because of the persistent uncertainty that faces us regard 
mg the mechanism of pitch perception in this region Here, 



FreOUENCY 

Fig 90 The masking produced by low tones The masking produced by 
the 10-cycle tone at 20 and 40 db above threshold was measured by Bckfsy (2’) 
The vertical line shows the sensation level of a 75-cycle tone employed ly 
Fletcher (2) to mask other frequencies, as indicated along the abscissa 

however, our interest is more abundant than our data Bckesy 
(22) presents ns with results cbtawed by measuring the eleva 
Don in the threshold of a wide range of tones listened to in the 
presence of a 10-cyclc tone at two sensation levels— 20 and 40 
db This \cry low frequency, at 40 db above threshold, masks 
a major part of the audible frequency range, as shown in Fig 
90 E\cn at 20 db above threshold its effects arc rather wide 
spread One other record of masking by a low frequency —75 
cycles — comes to us from Fletcher (2) His is the middle 





THE MASKING OF SOUND IMPULSES 


213 


curve o£ Fig 90 The dotted portions are the regions in which 
beats appear, and the circles above the curve show the magm 
tudes of the several harmonics of the 75-cycle tone The gen 
eral course of the curves for 10 and for 75 cycles is clearly 
similar, although the maskmg due to 75 cycles shows some 
evidence of decreasing for frequencies lower than this tone — 
an important fact, if finally established 

THE MASKING OF SOUND IMPULSES 

It is manifestly impossible to mask a steady tone by means 
of a short impulse of sound, for the tone would be heard before 
and after the impulse The reverse experiment can be earned 
out however and Bekesy (15) has obtained the pair of curves 
in Fig 91 showing how loud a tone must be in order to mask 



Fig 91 The intensity level at which various tones are just able to mask 
a click whose loudness-level is 40 db (After Bekesy 15 ) 

a click whose loudness level is 40 db Each of these curves is 
for a single observer As we might well expect, the tone does 
not completely suppress the click unless its loudness-level is con 
siderably above that of the dick — 30 db at the least Tones 
below about 700 cycles are distinctly more effective in masking 
impulses than are tones of higher frequency Although this 
last conclusion is supported by Bekesy’s results, we may be 
critical of its generality, for we have already seen how varied 
can be the effects of impulses of different wave forms (see p 
157) Perhaps Fig 91 would look different for other types of 
short sounds 




214 


AUDITORY MASKING FATIGUE, AND PERSISTENCE 


MASKING WITH TONES IN OPPOSITE EARS 

Crucial to any quest for the origin of the mashing effect is 
a knowledge of what happens when we put the masking tone 
in one ear and the tone to be masked in the other Figure 92 
supplies this information A tone of 1200 ejeles was used to 
mask other tones whose frequencies are indicated on the plots 
When both tones were led to the same car, the masking was as 
shown by the dotted lines, but, when the two tones stimulated 
opposite ears, the masking fell to the values indicated by the 
solid curves The solid and the dotted curves arc generally 



NTCNS tv or MASK NC TONE M OB 


Fic. 92 Showing to what extent a tone of 1200 cycles at various intens t a 
is able to mask a tone in the oppos te car (sot d curves) The dotted curies 
show the masking when both tones arc led to the same car (After Wcgel 
and Lane) 

similar in form, but the solid curve is displaced 40 to 60 db to 
the right along the horizontal axis In short, the masking 
tone must be raised 40 to 60 db m intensity before the masking 
m opposite ears equals the masking in one ear alone The 
curves of Fig 92 may be explained if we assume that there are 
of. u?.i skvwf,, cits/jaA a.9/1 ijccjjh/wd. (Wc<gd. ao/J. 
Lane) Central masking is relatively small, but is probably 
always present By contrast, peripheral masking is relatively 
large, but is present only when the two tones stimulate over 
lapping areas on the basilar membrane 

A larger amount of masking occurs, e\cn when the tones 




RELATION BETWEEN MASKING AND EXCITATION 


215 


are in opposite ears, whenever the intensity of the masking tone 
is great enough Note how sharply the cun es bend upwards 
in the region of 60 to 80 db This behavior is understandable 
on the assumption that stimulation at high levels causes some 
sound to be conducted through the head to the opposite ear, 
where it establishes peripheral masking In keeping with this 
notion, we must assume, then, that the attenuation through the 
head from one ear to the other is of the same order of magnitude 
as the horizontal displacement between the dotted and the 
solid curves of Fig 92 (see Chapter 11) 

There is still further evidence that when a tone is introduced 
into one ear by a telephone receiver, the opposite ear is excited 
to some lesser degree People very deaf in one ear are able to 
hear with the receiver on the deaf ear, provided the intensity is 
raised 40 to 60 db abo\ e that required with the receiver on the 
normal ear Hearing, under these conditions, is improied by 
pluggmg the good ear with the finger — a simple test, which 
shows that the sound reaching the good ear gets there by bone 
conduction 

RELATION BETWEEN MASKING AND EXCITATION 

The curve, or audiogram, depicting the course of the audi 
tory threshold in the presence of a masking sound, may be in 
terpreted as a picture of the pattern of excitation within the 
cochlea, for which the masking sound is responsible Thus, 
in Fig 89, where we find that the presence of a 1200-cycle tone 
at 80 db above threshold raises the threshold for a 2000-cycle 
tone by about 40 db, we have evidence that the 1200-cycle tone 
does something to affect the sensitivity of that part of the basilar 
membrane which normally responds to a frequency of 2000 
cycles If a single tone were able to confine its effects to a very 
restricted area of the membrane, it would mask other tones only 
slightly — and this masking would be central in origin — but 
owing to the large amount of damping in the ear, a single fre 
quency stimulates a wide area To this spread there is added, 
at great intensities, the effects of the aural harmonics, with the 



216 AUDITORY MASKING FATIGUE AND PERSISTENCE 

result that a loud tone may make its presence felt, to some ex- 
tent, throughout the entire cochlea 

As an example of what the patterns of excitation in the 
cochlea are like in the esent of stimulation by a single tone of 
1000 cycles, at several intensities, constder Fig 93 These cur\ es 
were adapted from data on the masking effects of a 1000-cjde 
tone (Fletcher, 2) The humps m the upper cunes represent 
the aural harmonics, and they appear at distances along the ab- 



5 10 (5 20 25 

DISTANCE FROM HtLICOTREMA 


Fig 93 The patterns of stimulation on the basilar membrane due lo 3 tone 
of 1000 cycles as determined from data on mask ng T1 e parameter here is 
ihc intensity level of the I000-c>c!e tone 

scissa corresponding to their proper locus within the cochlea, as 
outlined in Chapters 3 and 15 

Wc have said that the curves of Fig 93 correspond to the 
actual patterns of excitation, or stimulation, on the basilar mem 
brane This statement is true enough, provided wc define ex 
citation in terms of masking Such a procedure is safe, but its 
safety does not compensate for the fact that it still leaves us vague 
as to just what features of the activity of the auditory mechanism 



FATIGUE 


217 


will determine whether or not a tone will be masked There 
are alternative possibilities Wegel and Lane suggest that, in 
order that a tone be heard above a masking tone, its amplitude 
of motion at its proper locus on the basilar membrane must 
equal the amplitude already existing at that locus as a result of 
the masking tone Then the curves of Fig 93 would represent 
the relative amplitudes of vibration of the basilar membrane 
at different positions Before proceeding with this interpre 
tation, it is important to note that the ordinate of Fig 93 is 
logarithmic Changed to a lmear ordinate, the peaks of the 
curves would be much more salient 

Another possibility is that masking bears no simple rela 
tion to the mechanical motion of the basilar membrane, but is a 
function of the level of excitation in the fibers of the auditory 
nerve In this case, the parts of the curves of Fig 93 lying 
between the harmonic peaks would represent the magnitude 
of certain neural events Although the quantitative relations 
between amplitude and nerve potentials are yet to be explored, 
the characteristics of the nervous mechanisms involved m hear 
ing certainly account for some types of masking (see p 409) 

FATIGUE 

Auditory fatigue is an interesting phenomenon, more be 
cause of its absence than because of its presence It is surprising 
that the ear, assailed as it is both day and night by sounds and 
noises of all sorts, suffers so little decrement m acuity No flap 
or lid enables us to protect our ears from unwanted disturbances, 
and we must even leave them open when we sleep Happily we 
learn to disregard the great bulk of the sounds we hear, at the 
same time preserving a selective attention for what we consider 
significant Even in sleep we may learn to disregard the roar 
of traffic from the street, but waken at the faint sound of some 
one stirring in the room Not that excessive unwanted noise is 
without ill effects upon the organism, but, as far as the ear 
itself is concerned, the din of modern life leaves us little the 
worse for it 

It is true that extreme sound pressures may shatter the audi 



218 


AUDITORY MASKING FATIGUE, AND PERSISTENCE 


tory mechanism by producing actual lesions that lead to 
permanent loss of hearing (see Chapter 15), but this does not 
constitute fatigue By fatigue we mean a temporary loss of 
auditory sensitivity due to previous auditory stimulation Ob 
viously then, the straightforward method of measuring fatigue 
is to determine a subject s sensitivity before and after he has 
been stimulated b> the fatiguing sound Three of the most 
common measures of fatigue have been (I) the change in the 
absolute threshold of a sound, (2) the change m the apparent 
loudness of a sound, and (3) the change in the apparent position 
of a binaurally localized source of sound (cf Banister for re 
cent review) 

A variety of studies, using these methods, have demon 
strated auditory fatigue But instead of disclosing a precise 
and easily measurable phenomenon, these studies show auditory 
fatigue to be elusive and variable, and the reader finds a con 
sidcrable lack of agreement in the sporadic literature on the 
subject There are no charts showing quantitatively the amount 
and duration of losses, due to previous stimulation, for a wide 
range of frequencies and intensities We must content our 
selves with a few generalizations to which the exceptions are 
all too numerous 

Previous stimulation causes an elevation of the auditory 
threshold, but the effect is generally short lived — a matter of a 
few seconds to a few minutes Whether the fatigue will last 
seconds or minutes has been said to depend on the frequency 
of the stimulating tone (Bronstein and Chunlova) Thus the 
time necessary for the threshold to return to normal after the 
ear had been exposed for 2 min to tones at a loudness level of 
94 db increased from 20 sec to 6 mm as the frequency of the fa 
tiguing tone was raised from 100 to 4000 cycles But differences 
amon^ individuals are large as regards the duration of fatigue 

The greatest amount of fatigue occurs at the frequency of 
the stimulating tone, but for low tones the effects of fatigue — 
when any — may spread to frequencies fairly far remov cd Pre 
1 vious stimulation also causes a loss in the loudness of subsequent 
tones, and here again there is evidence that the loss is greatest 



FATIGUE 


219 


at or near the fatiguing frequency This type of evidence, so 
far as it goes, suggests that the pattern of fatigue on the basilar 
membrane presents the same sort of picture as that disclosed 
by the masking curves of Fig 93 

An interesting consequence of such a pattern of fatigue is 
reported by Bekesy (3) who found that, as a result of exposure 
to an 800-cycle tone, the pitch of tones slightly removed from 
800 cycles was raised or lowered, depending upon whether the 
tone was above or below 800 ejeles in frequency This change 
in pitch was greatest at 500 and at 1200 cycles, and amounted to 
about 7 per cent Bekesy’s explanation is to the effect that the 
1200-cycle tone finds the pattern of fatigue along the basilar 
membrane to be greater just below than just above the region 
which resonates to 1200 cycles Hence, the pattern of excitation 
due to the 1200-cycle tone becomes skewed in such a way that its 
maximum is shifted along the membrane toward the oval 
window, and the result is an elevation of the perceived pitch 
An analogous, but opposite, effect occurs at 500 cycles 

A novel aspect of the problem of auditory fatigue comes to 
light m the work of Rawdon Smith (2) and appears to explain 
why so many careful and well planned experiments have led 
to contradictions In the first place, he found that, when the 
fatiguing stimulus falls on one ear, the opposite ear suffers a 
decrement in sensitivity — a decrement which is not limited 
to the fatiguing frequency alone Hence, some of the effects 
which we class as auditory fatigue appear to originate centrally 
Now, when the factors causing a change in an organism’s 
normal response to a stimulus are conditioned upon the state 
of the central nervous system, we are not surprised when we 
find increased variability Central inhibition is a labile phe 
nomenon Consequently, both the variability and the binaural 
nature of auditory fatigue can be accounted for, if we assume 
that the loss in sensitivity is due to the intervention of cortical 
factors The phenomenon would partake, then, less of the 
nature of sensory fatigue than of the nature of inhibition, and 
the well known phenomenon of disinhibitton (Pavlov’s tnhi 
bition of inhibition ) would be likely to appear Rawdon-Smith 



220 


AUDITORY MASKING FATIGUE AND PERSISTENCE 


looked for this effect and found it by giving the observer an 
unexpected, but innocuous, stimulus, such as momentary dark 
ness in the sound proof room His threshold, which had been 
tested immediately before the unexpected stimulus, was retested 
at once, and was found to ba\e moved toward the normal 
unfatigued level Thereafter, the sensitivity declined again, 
but could be restored by repeating the unexpected stimulus 

The phenomenon of auditory fatigue appears, then, to be 
complicated by some type of central inhibition, which makes 
it hard to discover, by psychophysical experiment, the actual 
loss of sensitivity within the sense-organ due to previous stimu 
lation Perhaps we had best turn, for this information, to the 
more direct observation of the behauor of the ear, as it is re 
vealed in the electrical output of the cochlea (see Chapters 13 
and 14) 

SENSITIZATION DUE TO STIMUt ATION 

Before leaving the problem of auditory fatigue, wc should 
consider an apparently opposite effect which arises under condi 
tions which would be expected to produce fatigue A series 
of studies by Bronstem shows that after exposure to a loud tone 
the auditory threshold not only returns to normal, but also, often, 
it falls below normal for a period of time This increased sen 
sitmty, which may amount to 10 or 15 db, extends to frequencies 
other than that of the stimulating tone As with fatigue, there 
is some sensitization of the opposite car, when only one cac has 
been stimulated This, and other features of the phenomenon, 
suggest that sensitization and fatigue arc both due, in part, to 
cortical factors 

THE PERSISTENCE OF SENSATION 

Closely allied to the problem of the after-effects of sound 
stimulation is the problem of the duration of the sensation itself 
We arc all aware that the sensation stops abruptly when a tone 
is turned off, and in audition, unlike vision, we experience no 
marked after images So quickly, m fact, docs the effect of an 



THE PERSISTENCE OF SENSATION 


221 


auditory stimulus die out that the measurement of its rate of 
decay is a difficult undertaking 

The problem has interested many investigators, including 
Helmholtz, but, unfortunately, much skillful experimenting has 
been squandered on an effort to determine auditory persistence 
by a kind of ‘flicker technique Experimenters have set out 
to discover how rapidly a tone must be turned on and off before 
the interruptions are too bnef to be noticed, so that fusion oc 
curs The success of the flicker method m vision is beyond 
quesuon, but in audition it is a failure The reason lies in the 
fact that it is impossible, in a sense, to turn a tone on and off, 
for when we try to do so we merely introduce additional fre 
quencies into its acoustic spectrum and obtain a more complex 
sound (The same result is true, actually, of a visual stimulus, 
but, as we saw on p 94, the effects of modulating a light are 
negligible, compared with the sensitivity of the eye ) The next 
chapter treats the problem of modulation, and we shall see 
there that the turning on and off of a tone is essentially a case 
of amplitude modulation Consequently, most of the expen 
ments designed to measure the persistence of auditory sensations 
can best be classified as experiments on modulation, and the 
results can be explained in terms of the effects of modulation on 
the frequency spectrum of a sound 

Are there, then, any experiments which can be said to meas- 
ure persistence ? It is quite likely that certain observations by 
Bckesy (14) can be so classified He measured the slowest rate 
of decay of a tone which would give the sensation of a tone 
ending abruptly He allowed a tone to die out exponentially 
over a period of time that was long enough to be clearly de 
tectable and he then proceeded to shorten the time of decay 
until further shortening gave no noticeable difference In other 
words, a rate of decay can be found such that a faster rate makes 
no difference in the sensation — no difference, that is to say, until 
the tone ends so abruptly that a dick is heard Presumably, 
this critical rate of decay is just equal to the rate of decay of the 
sensation itself, and any more rapid decay in the stimulus is 
obscured by the fixed rate of decay of the sensation 



222 AUDITORY MASKING, FATIGUE, AND PERSISTENCE 

Figure 94 gives the values of the critical rates of decay for 
an 800-cj de tone as a function of sensation le\ el The ordinate 
gives the time necessary for the tone to die out to one thousandth 
of its initial value, 1 e, to decline through 60 db 

Now, since Bekesy has given us the rate of decay for these 
tones, and also their initial intensities in decibels above threshold, 
it is a simple matter to calculate the time which will elapse 
before they decline to the auditory threshold From Fig 94 
it is dear that the critical rate of decay for the louder tones is 



Fic 94 The decay time of an 800-cycle tone which is extinguished at the 
rale of physiological decay The abscissa gives die initial sensation level 
of the 80(fcycle tone (After Bctcsy, H ) 

greater than for the weaker ones In fact, if v\c plot the decay 
of each tone, as a function of time, vve obtain the results shown 
m Fig 95 Here it is apparent that, regardless of the initial 
intensity, all the tones reach the value of the auditory threshold 
in approximately the same length of time, namely, 0 H sec Con- 
sequently, if v\ c accept the argument that these rates of decay 
are equal to the rate of decay of auditory sensation, vve arc 
forced to conclude that, regardless of the intensity of stimula- 
tion, a sensation will take very nearly 0 14 sec to die out after 
stimulation ceases 

Is such a conclusion reasonable ? Not if the persistence of 
sensation is due to continued mechanical vibration of the ear, 




THE PERSISTENCE OF SENSATIOV 


223 


nor if it is due to the accumulation of some excitatory neural 
substance whose concentration depends on the intensity of stun 
ulation, for m both these cases persistence would be longer when 
the tones are louder However, if we assume that the central 
neural elements responsible for auditory sensation behave in a 
strictly independent fashion, so that added intensity of stimula 
tion senes merely to increase the number of elements excited, 
then the effect due to each element would die out independently, 
and we should expect to find that intensity makes no difference 
to persistence If such is true, we may consider that the time 
0 14 sec represents approximately the time of decay of the ex 
citation of a single one of these central elements, and also the 
time of decay of the total auditory sensation 
It is necessary to assign 
this phenomenon of persist 
ence to a central mechanism, 
because no persistence as 
great as 0 14 sec has been 
observed in the physiologi 
cal effects detectable m the 
cochlea or in the auditory 
nerve All that we can 
really conclude, provided 
Bckcsy’s relations can be 
shown to hold at other fre 
quencies, is that there is not, 
m audition, the type of sum 
mation that builds up an 
excitatory substance, having 
a constant rate of decay, in 
such a manner that higher 
concentrations occur at 
higher intensities of stimu 
lation The exact mecha 
msm underlying a phenom 
enon of persistence having the charactcnstics shown in Fig 95 
remains to be determined 



Fig 9> The decay-curves for an 800- 
cycle tone beginning at various sensation 
VcnSa fariniEfte^ ani ditdanmg to thren 
old The rate of decay is the critical 
rate determined by Bckesy (14) and the 
time to reach threshold is shown on the 
abscissa by the intersections of the curves 
with the zero-ordinate 




224 


AUDITORY MASKING, FATIGUE, AND PERSISTENCE 


In auditory experiments, the practical problem often arises 
as to what should be done about turning tones on and off in 
such a way as to avoid a click and yet give the impression of 
instantaneous starting and stopping Figure 95 shows that the 
ideal way to turn a tone off is to allow’ it to decay to theshold 
intensity over a period of 0 14 sec Likewise, the ideal build up 
tune for turning a tone on is of the same order of magnitude, 
although quantitative measurements of this effect appear not 
to hate been made 



CHAPTER 9 


MODULATION VIBRATO AND BEATS 

Whenever a characteristic of a sound wave is varied, the wave 
is said to be modulated Thus if we change the amplitude, the 
frequency, or the phase of a sound, we produce amplitude , 
frequency , or phase modulation Even when we turn a tone 
on and off, we are effectively modulating the tone, although 
m practice, the term modulation usually refers to a periodic 
modification of a wave All three types of modulation may oc 
cur smgly or together, but our chief concern will be with the 
effects of amplitude or of frequency modulation occurring 
alone 


THE NATURE OF MODULATION 

First, let us examine the nature of a modulated tone, in order 
to see just what is the difference between frequency and ampli 
tude modulation Suppose that we have an audio oscillator 
producing a pure tone in a loud speaker We could very easily 
produce an amplitude modulation by wiggling back and forth 
the dial controlling the intensity of the tone Or we could ere 
ate a frequency modulation by turning the tuning control up 
and down An oscillographic record of the resulting sound 
wave would look quite different in the two cases Mampula 
tion of the intensity control would give us a wave whose height 
from crest to trough varies with time, whereas turning the fre 
quency-dial would produce waves which are alternately closer 
together and farther apart Suppose, now, that each dial is 
turned back and forth in a simple periodic fashion, so that the 
changes produced in the tone are sinusoidal with time Then 
we should be producing the simplest form of periodic modula 
tion We could take an oscillographic record of these two types 
of modulated waves and analyze them into their Fourier com- 
ponents, m order to discover what steady tones would, when 
225 



226 


MODULATION VIBRATO AVD BEATS 


mLxed together, produce the same form of wave These steady 
tones make up vs hat vv e term the acoustic spectrum of the wave 

Now, when it is discovered that a modulated wave can be 
analyzed into a spectrum of steady components, the question 
arises as to how wc are to regard modulated tones Arc they 
tones which vary continuously in either amplitude or frequency, 
or arc they groups of steady tones ? The answer is that they are 
both— they may be regarded from either point of view, depend 
ing upon our purpose The steady components are actually pres 
ent in a physical sense, as can be shown by the type of analyzer 
capable of responding to certain individual frequencies and of 
excluding others On the other hand, when all the frequencies 
in the spectrum of a modulated tone affect simultaneously the 
same vibrating body, such as a microphone, they force it into 
a form of vibration whose amplitude or frequency vanes con 
tmuously with time In the example we hav c been considering, 
the modulated tone was produced by varying continuously a 
control on an oscillator We could have produced precisely the 
same tone by turning on several oscillators, each tuned to a 
definite frequency, provided the tones from the different oscil 
lators had the proper phase and amplitude relations 

When our purpose is to study the effects of modulated tones 
upon the ear, it is most profitable, as vve shall see, to regard the 
modulated wave as a spectrum of steady components The ear, 
as a frequency analyzer, tends to respond to each component 
separately It fails to resolve these components completely one 
from another, for the ear is an imperfect analyzer, but vve can 
understand its failures once we grasp the nature of its task 
Basic to this understanding is a knowledge of the spectrum of a 
modulated tone 

A direct method of obtaining the component frequencies 
in a modulated wave is by mathematical calculation (see Appen 
dix I) In the equation for a simple sinusoidal wave, vve can 
substitute, in place of the constant which stands for the 3m 
phtude of the wave, a function that varies with time, and solve 
the equation We then obtain the frequency and amplitude 
of all components of the wave whose amplitude is being modu 



THE NATURE OF MODULATION 


227 


lated An analogous substitution and solution discloses the 
components generated by frequency modulation In both cases, 
vve discover that whenever we modulate the frequency or the 
amplitude of a tone we obtain a complex spectrum consisting 
of a central component, or band, with side bands distributed 
symmetrically on either side These side bands are spaced a 
distance apart equal, in cycles per second, to the rate at which 
the modulation occurs Their relative amplitudes are a func 
tion of the range of the modulation, where range is defined as 
half the distance between the highest and the lowest frequency 
or amplitude reached during any part of the modulation Now, 
when the amplitude of a tone is modulated sinusoidally, the 
resulting spectrum contains the central component and only two 
side bands, one above and one below the frequency of the cen 
tral component However, when the frequency is modulated 
sinusoidally, the number of side bands is theoretically infinite, 
although, when the range of modulation is small, only those 
side bands close to the central band have appreciable amplitude 
Thus, when the range is numerically less than half the rate, the 
spectrum consists of only three appreciable components — just as 
does the spectrum of a wave undergoing amplitude modulation 
Furthermore, there is the surprising fact that the three com 
ponents arising from frequency modulation may be identical in 
frequency and intensity to those generated by amplitude modu 
lation 

In other words, we find that three different frequencies led 
simultaneously to the ear may give rise, in one instance, to 
the sensation of a tone waxing and waning in loudness, and the 
same three frequencies may, in another instance, produce the 
unprcssron a tore pAtfa Tists tsA fails tSitTi, 

are we to explain this effect ? Since the components in the two 
spectra are alike in amplitude and frequency, the only difference 
possible between them is one of phase And it is, in fact, just 
a difference of phase which determines whether the three com 
ponents will summate to gi\e a frequency modulation or an 
amplitude modulation 

In order better to illustrate these important relations, let us 



228 


MODULATION VIBRATO AND HI ATS 


choose three components whose frequencies arc 6 , 8, ami 10 
cycles, and let the amplitude of the 8-cycle component be 4 
times as large as the amplitudes of the two side bands Then, 
as shown m Fig 96, we can arrange the phases of these com- 


- wwwwww 


““ VWVWWW1A/ 

6 ~ 

e~ 
io ~ 

Pic 96 Showing how the sinic tlircc components 6, S, and 10 cjcJcs, can 
he added in different phase relations in order to produce either an amplitude 
or a frequency modulation 

ponents m such a way that the three waves added together give 
an amplitude modulation However, if we shift the phase of 
the central component by 90° we find that the three waves sum- 
mate to produce a frequency modulation In Fig 96 each of 
the modulated waves was obtained by adding algebraically the 
instantaneous amplitudes of the three waves drawn directly 
under each (Actually, in constructing the wave whose fre- 
quency is being modulated, other sidebands were neglected, 
but the amplitude of the largest of these amounts to only 3 per 
cent of the amplitude of the central band ) 

These interesting relations between the two types of modu 


WWWWWW 



THE NATURE OF MODULATION 


229 


lation obtain likewise when the two spectra are more complex, 
and it is possible, in general, to transform a frequency modula 
tion into an amplitude modulation by readjustmg the phases of 
the components If there are more than two side bands, how 
ever, the amplitude modulation will be nonsinusoidal At 
intermediate phases we may obtain a combination of frequency 
modulation and amplitude modulation Thus, if we had 
changed the phase of the center band (Fig 96) by 45° we should 
have created just such a hybrid modulation We find, then, 
that m specifying the spectrum of a modulated wave, we must 
State the frequency, the amplitude, and the phase of each com 
ponent All three of these variables are specified m the formulas 
for modulated tones (see Appendix I) 

Now, the role of phase turns out to be unexpectedly crucial 
in these considerations, despite the well accepted doctrine that 
the ear does not take account of phase relations We have 
already encountered in Chapter 7 instances m which changes in 
the phase relations of harmonic components produced noticea 
ble effects, but here we have even more dramatic evidence that 
the ear may be extremely sensitive to the relative phases of the 
components of a sound Consequently, before proceeding to 
a consideration of specific experiments involving modulation, it 
may be profitable to inquire into the behavior of the ear under 
the impact of a modulated tone 

Since any type of modulated wave can be analyzed into a 
spectrum containing several steady components, the ear would, 
if its tuning were sufficiently sharp, hear all the components 
independently and simultaneously, just as it hears at once the 
flute and the cello in an orchestra Then, when the tuning 
dtd xA an osuVxalcn is tamed bat’* and ioidn, untead ul bearing 
a pitch which rises and falls, we should hear only a group of 
steady tones spaced a certain distance apart Even though the 
tuning dial were moved continuously, so that, to all appearances, 
the change in frequency is likewise continuous, there would be 
certain frequencies which we should hear and intermediate fre 
quencies which we should not heart Such is the nature of 
frequency modulation — a continuous change in frequency pro- 



230 


MODULATION VIBRATO AND BEATS 


duces a discontinuous spectrum Nothing, perhaps, is more 
contrary to intuition than that we should be able to change the 
frequency of an oscillator continuously between two limits 
without producing all intermediate frequencies, but that is pre 
cisely what we do when we generate a sinusoidal frequency 
modulation And if the car were a better analyzer it would 
tell us so 

As it is, the ear does not completely resolve the components 
of the spectrum, and we hear a pitch which follows the move 
ments of our tuning dial, provided they are not too rapid The 
explanation of this fact is apparent because the ear is not very 
sharply tuned, each of the components stimulates a rather wide 
area on the basilar membrane, and these areas of disturbance 
overlap to some extent It happens, in a frequency modulation, 
that the components at one end of the spectrum are m such a 
phase relation that, at one time during the modulating cycle, 
these components all reinforce each other and cause the maxi 
mum of the disturbance on the membrane to move toward their 
location At another part of the modulating cycle these com 
ponents are tending to cancel each other, while those at the 
opposite end are enjoying mutual reinforcement, and so the 
maximum of the disturbance finds itself in a new place At 
mtermediate times it is located between these two extremes 
Now, if the location of this maximum is what determines the 
pitch of a tone, it is plam why the ear perceives a slow rate of 
frequency modulation as a continuous change of pitch, in spite 
of the fact that the stimulus really contains only a set of discrete 
tones In other words, it is the beating among the components 
of the spectrum which gives us the illusion of a continuously 
changing frequency (A schematic representation of this 
process is shown in Fig 32, p 91, in connection with the prob 
fern of the DL for frequency discrimination ) 

What, then, is the nature of the effect under amplitude 
modulation? Here again we can say that if the ear were a 
better analyzer it would hear a group of steady tones whenever 
someone turns the intensity control of an oscillator up and 
down Resolution is poor, however, and each component stim 



AMPLITUDE MODULATION INTERRUPTED TONES 


231 


ulates an extended area on the basilar membrane These 
areas of disturbance alternately reinforce and interfere with one 
another, just as they do under frequency modulation, but with 
this one important difference — the maximum of die disturb- 
ance is never displaced from its position over the center of the 
spectrum Phase relations are such that all the components 
work symmetrically to reinforce or cancel the central compo- 
nent without ‘skewing’ the pattern of disturbance Conse 
quently, the ear hears a tone whose loudness rises and falls, but 
whose pitch remains constant 

All these considerations pertain to modulations whose rates 
are not more than about 6 per second We shall see later that 
additional complications may arise at faster rates, for then the 
components are spaced farther apart on the basilar membrane 
and the shift in the pattern of disturbance is too rapid to be 
perceived as a change in pitch 

AMPLITUDE MODULATION INTERRUPTED 
TONES 

The last chapter showed how most of the experiments on 
auditory persistence are really experiments on the effects of 
amplitude modulation Later we shall see how some of the 
experiments on frequency modulation can be regarded as ex 
periments on persistence Here, however, w e shall mi estigate 
experiments on interrupted tones 

In their well known experiment, Wemberg and Allen made 
an effort to interrupt a tone issuing from a closed box by means 
of a rotating disk which had four sy mmetrically placed holes in 
it The tone was heard when the holes of the disk coincided 
with a hole in the box When the disk was rotated rapidly 
enough, the interruptions were not detectable The authors 
therefore concluded that, when the interruptions are sufficiently 
frequent, fusion occurs — as it does in vision 

Wingfield, however, repeated this experiment with more 
adequate technique, but was unable to demonstrate fusion when 
the interruption of the tones w as complete, regardless of the rate 
of interruption By arranging his apparatus so that the cut 



232 


MODULATION VIBRVTO AND BEATS 


off was only partial, allowing some sound to reach the observer 
all the time, he could obtain fusion of the sort previously re 
ported 

Now, it is clear that these experimental conditions were such 
as to produce an amplitude modulation If the cut-off had 
been instantaneous, the modulation would ha\c been what we 
term ‘square topped, and the resulting spectrum would ha\e 
contained an infinite number of components distributed in 
groups of continuous bands As it was, the cut-off, although 
not exactly sinusoidal, was sufficiently gradual so that the spec 
trum probably contained a finite number of components, spaced 
apart at a distance equal to the rate of the interruption An m 
crease m the frequency of interruption would then serve merely 
to move the components farther apart, and to increase the rate 
with which they beat with one another Hence, at no rate of 
interruption, no matter how great could we expect to free the 
resulting sensation from the effects of these side bands, unless 
the range of modulation were so small as to reduce these side 
bands to a negligible intensity 

When Wingfield made the cut-off only partially complete 
the effect was simply to reduce the range of the modulation 
which in turn reduced the magnitude of the side bands Then 
with these side bands sufficiently small a point could be reached 
at which the central band alone was perceived This is the 
point where fusion was supposed to have occurred but the pres 
ent analysis shows us that it can be no question of the fusion of 
successively discrete stimulations because we are here dealing 
exclusively with stead) tones The large central component 
will always appear steady and unvarying to the listener when 
ever the alternate reinforcement and cancellation it receives 
from the side bands are less than the DL for intensity dtscnmi 
nation Tt w fli 'dc recaVieh in iatfr Vnat ’fcitsz useA trssemrWry 
this same method of interfering tones to measure DL’s of inten 
sity (Chapter 5 pp I36-H1) 

With rapid rates of modulation — rates equal to half the 
frequency of the tone itself — it has been shown experimentally 
that fusion does not occur Kucharshi was able to eliminate 



DEMODULATION 


233 


every other cycle from a 200- and a 1000-cycle wave, a process 
equivalent to a square topped amplitude modulation at the rate 
of 100 and 500 cycles, respectively Both these modulated 
tones, far from giving a clear impression of fusion, gave sensa 
tions differing markedly from those of pure tones The inter 
ruptions in the 200-cycle tone were clearly perceptible as appar 
ent breaks, but with the 1000-cycle tone the impression was 
more one of roughness In both cases the central component 
was still detectable, l e , trained observers could detect a pitch 
corresponding to 200 and to 1000 cycles 

DEMODULATION 

We have seen (m Chapter 7) that, when two tones are 
present simultaneously, a difference tone may be heard Con 
sequently, when the amplitude of a wave is modulated in 
such a way as to produce three component frequencies, we 
should expect the ear to hear a tone equal, in frequency, to the 
difference between the components, provided the difference is 
large enough That the ear does behave in this manner can be 
shown by modulating a 1000-cycle tone at the rate of 60 per 
second (Stowell and Deming) In addition to the thi^e com 
ponents, generated by the process of modulation, one hears a 
60-cycle tone, generated by the process of demodulation De 
modulation occurs whenever a modulated wave is passed through 
a distorting system which partially rectifies the wave The 
nonlinearity and asymmetry of the auditory mechanism pro- 
duce just this sort of rectifying effect, so that we tend to hear 
the modulating frequency whenever this frequency is not 
too low 

Stowell and Deming were able to show that the loudness of 
the modulating tone (60 cycles) is, as w e should expect, a defi 
nite function of the range of the amplitude modulation No 
audible demodulation occurred, m their experiments, when the 
range of modulation was less than 4 per cent of the amplitude 
of the 1000-cycle tone They also found that the frequency of 
the modulated tone is an important factor in determining how 
loud the 60-cycle tone will sound In other words, more de 



234 


MODULATION VIBRATO AND BEATS 


modulation occurs at some frequencies than at others In their 
experiment, demodulation was most prominent for tones near 
1000 cycles, and was absent at very low and at \ery high fre- 
quencies 

FREQUENCY MODULATION THE VIBRATO 

The vibrato has long been utilized as a melodic embellish- 
ment that can be added to any note The singer and the violin 
ist, m particular, find that mastery of the vibrato is an important 
refinement of their art With violin music it is clear what the 
physical nature of the vibrato is, for the violinist produces this 
musical effect by a rapid alteration of the length of a vibrating 
string by the movement of his finger, thus producing a fre 
quency modulation What the singer creates in the way of 
modulation becomes dear only by the analysis of his actual 
vocal production The extensive studies earned out at the 
University of Iowa (Seashore) have yielded analyses of the 
vibrato as it is developed in the renditions of recognized artists 
These investigations confirm the impression that the vibrato is 
essentially a frequency modulation, regardless of whether it is 
produced by instrument or by voice, but they reveal the addi 
tional fact that it is not uncommon to find present a small degree 
of amplitude modulation as well 

The average rate of the vibrato is about 7 fluctuations per 
second, but the rate among the better musicians is higher than 
among the less skilled The range of the vibrato among 
violinists is about one-eighth of a tone (where range is defined 
as half the total extent), and among singers it is about one 
fourth of a tone Singers, however, show less uniformity than 
violinists 

These are the essential facts pertaining to the vibrato as it 
occurs in musical practice The concern of the psych ophysiolo- 
gist is not so much a matter of what the artist does as what the 
effect of his performance is upon the ear How does the car 
respond to modulations of rate and range used by musicians, 
and what are the critical values of these two variables m the 
production of certain subjective effects ? 



FREQUENCY MODULATION THE VIBRATO 


235 


As an example of the kmd of problem arising m frequency 
modulation, let us consider a specific mstance We can arrange 
a rotary condenser in the plate circuit of an audio-oscillator, 
so that, as the condenser revolves, the capacitance changes peri- 
odically and produces a periodic change in the frequency of 
oscillation of the oscillator Such a device gives us a practical 
method for imposing a frequency modulation upon a tone 
Now, suppose we adjust the condenser to give a range of modu 
lation equal to 10 cycles when the oscillator is generating a 500 
cycle wave The frequency would then vary continuously and 
sinusoidally (given the proper form of condenser plates) be 
tween 490 and 510 cycles The condenser, we shall assume, is 
being driven by a motor, and makes 1 revolution per second 
What would the resulting tone sound like? The listener 
would hear a pitch which appeared to rise and fall continu 
ously, over an extent equal to about a semitone The actual 
stimulus, we know, consists here of a rather complex spectrum 
containing many individual tones spaced 1 cycle apart, but 
after these components reach the basilar membrane they rein 
force and cancel one another in just the right sequence to cause 
the maximum of the disturbance on the membrane to move 
back and forth 

Now, let us increase the speed of the rotary condenser As 
the rate of modulation rises, what we observe is that the pitch 
of the tone moves up and down more and more rapidly, but, 
with continued increase, a point is finally reached where the 
change in pitch vanishes, leaving instead an apparent inter 
mittent change in intensity The critical rate at which the ear 
begins to hear a single pitch, composed of intermittent pulsa 
tions, is about 6 alternations per second in the example we are 
discussing Further increase in the rate to as high as 12 per 
second leads to an experience of a group of tones rather than a 
single pitch The side hands in this instance would be found 12 
cycles apart 

Another approach to the problem is to set the rate of modu 
lation at a fixed value and change the range by altermg the 
distance between the plates of the condenser This procedure 



236 


MQDULAHOM VIBRATO AKD BEATS 


would not change the position of the side bands in the spectrum 
of the tone, but it would alter their relative amplitudes As 
the range increases, the amplitude of the outer side bands grows 
larger, making the spectrum effectively wider How does the 
width of the spectrum affect our sensation ? Within certain 
limits, it appears to determine the richness of a tone Thus, 
as the range is increased steadily from zero, a point is eventually 
reached at which the tone appears, to the musical ear, maxi 
mally rich Then, after further increase, the richness gives vv ay 
to an experience of increased complexity 

Both these approaches were used by Ramsdell in order s>s 
tcmatically to determine the critical values of rate and range 
for maximal richness and for singleness of pitch He employ cd 
trained musicians as observers, because the judgment is essen 
tially a musical one, and he gave them instructions, at one time, 
to increase the rate of modulation until they achieved a tone of 
apparently unitary pitch, such as would be satisfactory in i 
singing voice At another time they were asked to vary the 
range until they obtained maximal richness The results for 
four different frequencies are shown in Fig 97 The circles 
show the values obtained when the rate was increased from a 
low value up to the rate which just gave singleness of pitch 
The upper part of the curve has been dotted to represent the 
rates at which a gliding pitch is no longer detected and all that 
remains is a complex mass of tones The almost vertical lines 
represent the results of adjusting the range of modulation to 
give maximal richness At the intersections of the two func 
tions we have what might be called the richest stnglest note 
obtainable under frequency modulation 

How, then, do these critical values of rate and range com 
pare with those of actual vibratos produced by good musicians ? 
On the plot for the 500-cyclc tone in Fig 97 is indicated the 
rate and range of a group of 20 voices studied by Metfesscl 
About half his cases fell within the limits of this circle (pro- 
vided we may assume that all the notes he studied were sung at 
500 cycles) Here wc sec that most vocal vibratos are just fast 
enough to produce a note which appears unitary in pitch, and 



FREQUENCY MODULATION THE VIBRATO 


237 


that they cover very nearly the optimal range for maximal rich- 
ness. The violinists studied by Hollmshead produced vibratos, 
most of which fell within the oval figure. The rates are very 
nearly the same as those of the singers, but the range is definitely 



O 10 20 30 40 50 O 10 20 30 40 50 


RANCt OF MODULATION IN CYCLES 

Fig 97 The critical rates and ranges of frequency modulation producing 
singleness of pitch (circles) and maximal richness (vertical dotted lines) 
(After Kamsdel! ) In the plot for 500 cycles, the large circle represents the 
rate and range of vibrato in the voices of accomplished singers, and the oval 
shows the rate and range of vibrato produced by expert violinists 

smaller. The rates could be lower without the listener’s being 
able to hear the tone as having a gliding pitch, but the range 
would have to be almost doubled to obtain maximal richness 
Explanation of all the effects of frequency-modulation can- 
not be made at present. Probably the most interesting problem 
demanding clarification is why, as the rate of modulation is in- 
creased, we go from a situation where the pitch is obviously 
gliding up and down to one in which the only thing apparent 
is a series of intermittent impulses resembling rapid beats. At 
slow rates, the steady components beat with each other and 
cause the maximum of the disturbance on the basilar membrane 
to glide back and forth, in the manner already indicated. That 




23S 


MODULATION VIBRATO AND BEATS 


much is clear But then, as the rate is increased, although the 
maximum continues to mo\e bach and forth, the movement no 
longer appears as a change of pitch After the rate reaches 7 
per second, no matter how extensive the excursion of the maxi 
mum on the membrane (no matter how wide the range of 
modulation), its gliding character is lost Hence, 7 per second 
appears as the limiting value for the preception of this phe 
nomenon In terms of the two limiting positions of the maxi 
mum of the disturbance, it would appear that if they succeed 
each other less often than 7 times per second they can be per 
ceivcd as occurring successively m time At faster rates they 
appear in perception as occurring simultaneously (This rule 
holds for wide ranges of modulation, where the end positions of 
the disturbance are far apart ) In other words, if the disturbance 
alternates between two positions within 0 H sec, it does not 
appear successive The figure of OH sec reminds us that 
Bek&y reported that the persistence of an auditory sensation 
lasts about this length of time (p 222) Hence, it is not un 
reasonable to suppose that the two stimulations, due to the dis 
turbancc when it is at the two ends of its excursion, are percen ed 
as simultaneous when they have not had time to die out to 
some definite value (not necessarily zero) before stimulation 
recurs Just what this value is has not, as yet, been determined 

This unproved, but suggestive, relation between the critical 
rate of modulation and auditory persistence would mean that, 
whereas most experiments on persistence have turned out to be 
experiments on amplitude modulation, certain experiments on 
frequency modulation yield information relative to the decay 
of auditory sensations 

Now, it is dear from Fig 97 that the experience of a gliding 
pitch may vanish at rates below 7 per second, but that then 
the range is definitely smaller and the disturbance on the basilar 
membrane does not mme so far This fact is apparent if wc 
plot the spectra of the modulations giving singleness of pitch 
for the 500-cycle tone These diagrams, presented in Fig 93, 
reveal how much narrower is the spectrum which gnes unitary 
pitch at a rate of 4 5 cycles than that which does not appear 



THE PITCH OF FREQUENCY MODULATED TONES 


239 


unitary until the rate of 7 cycles is reached Another important 
difference between these spectra is the predominance of the 
central component at the lower rate When this component is 
large, and when the spectrum is narrow, the maximum of the 
disturbance moves back and forth, but, when the maximum 
is at one end of its excursion, the part of the membrane located 
at the position corresponding to the other end of the excursion 
is still being stimulated almost to the maximal extent In other 
words, when the range of modulation is narrow, the difference 
between maximal and minimal stimulation at any one place 




Fig 98 The spectra for a 500-cycle tone whose frequency is modulated at 
the rates and ranges in cheated. These rates and ranges are the critical ones 
which produce singleness of pitch 

on the basilar membrane l> not so large as when the range is 
wide, and the maximum is, therefore, not so prominent Under 
these conditions a relatively slow rate will obscure the move 
ments of the maximum, and the pitch will not appear to glide 
up and down 


THE PITCH OF FREQUENCY MODULATED TONES 
Tiffin and Seashore summarized the earlier work on the 
vibrato with a statement to the effect that the vibrato, due to 
frequency modulation, is heard as one salient pitch correspond 
mg very nearly to the mean frequency of the modulation, and 
that, when the range of the vibrato is wide, the pitch is less 
accurately determined A consideration of the spectra of satis- 
factory musical vibratos would lead us to believe that the pitch 



24 0 


MODULATION VIBRATO AND BEATS 


of all vibratos is less certain than that of a single pure tone 
Furthermore, it is quite possible that a very useful aspect of the 
vibrato, from the musician’s point of view, is precisely this uiv 
certainty of pitch, for it co\ers up slight errors in tuning If 
a singer with a good vibrato sings slightly off key, the audience 
will be unaware of it 

Since a modulated tone has a spectrum composed of several 
steady tones, we are led to ask whether any of the components 
m a vibrato can be heard separately In order to investigate this 
problem, three tones were modulated at the rate of 8 per second 
(Youtz and Stevens) The ranges were so chosen that the 

RAN6C s- I j 



RAVCE-28 ~ 



-14 -10 -« 2 I 2 6 0 4 


000 ~ 

Fic. 99 The d sinbutions of judgments of objen ers who set a pure tone lo 
equal a tone (1000 cycles) modulated tn frequency at the rate of 8 per second 
The ranges of modulation are md cated on each plot (After Youtz and 
Stevens ) 

central component of the resulting spectrum was twice as large, 
equal, and half as large as the two adjacent side bands These 
ranges were 15, 22, and 29 cycles, respectively, and the central 
component had a frequency of 1000 c)cles in each case The 
observers adjusted the frequency of a steady tone until it 
sounded equal in pitch to the modulated tone Figure 99 shows 
the distributions of the settings The wider scatter of the judg 
ments at the wider ranges demonstrates that pitch becomes less 
certain as the extent of the vibrato is increased 


BEATS 


241 


Closer analysis of the results from individual observers in 
this experiment revealed evidence that, when the range of modu 
lation was 29 cycles, the separate components of the spectrum 
stood out sufficiently to cause occasional close groupings of the 
settmgs around one of the two large side bands Even more 
direct, however, is the evidence from the \ eibal reports of the 
observers When the} raised the frequency of the test tone up 
to the value of pitch which they thought they detected in the 
modulated tone, they found that the pitch of the modulated 
tone had apparently moved still higher Then, when they 
moved the pitch of the test tone on up to the new pitch in the 
modulated tone, they found that this pitch had unaccountably 
shifted back to its original value In other words, whenever 
they had set the test tone to the pitch of one of the large com 
ponents of the vibrato, the pitch of the other component stood 
out so clearly as to make the settmg seem erroneous The ob 
servers never could make a single pure tone match, at one time, 
alt the pitches in the modutated tone From this fact, we may 
conclude that, when the range of a vibrato is sufficiently wide, 
the individual components of the tone stand out well enough 
to be identified separately 


BEATS 

Whenever two tones, of nearly the same frequency, are 
sounded together they produce beats at a rate equal to the dif 
ference between their frequencies Beats occur because the 
continuous change in the relative phase of the two tones leads 
to alternate periods of reinforcement and cancellation How 
ever, beats do not occur unless the two tones affect die same 
system If the ear were a really perfect analyzer of sound, if 
Ohm’s law held exactly, we should never perceive beats It is 
only because the two tones force into vibration overlapping re 
gions of the basilar membrane that an alternate waxing and 
waning of sound is heard This lack of sharp tuning in the 
ear also underlies, as w e hav e already noted, the effects produced 
by modulated tones In fact, we can classify beats as a kind of 
hybrid modulation m which the spectrum contains only two 



242 


MODULATION VIBRATO AND BEATS 


components Beats arc a combined amplitude and frequency 
(or phase ) modulation 

When two tones are sounded simultaneousl) and the differ 
ence between their frequencies is gradually increased from zero, 
three successive stages of the phenomenon are distinguished 

(1) the loudness appears to surge up and down continuously, 

(2) the beats are heard as a series of intermittent impulses, and 

(3) there is roughness without mtcrmittcncc The boundaries 
between these stages arc not sharp, but the character of the 
sensation within each stage is quite distinct (Wcver, 1) 

The first stage begins at an indefinitely low rate The slow 
est rate of beating that can be detected is probably determined 
only by the patience of the listener Wever reports listening 
to beats as slow as one in two minutes At that rate he heard 
the tone rise and fall very slowly in loudness, and these rises 
and falls were separated by periods of complete silence When 
the rate is increased to the vicinity of 2 or 3 beats per second 
we find that the waxing and waning of loudness is very promt 
nent This is the rate at which beats are most easily detected 
(cf Fig 53, p 137) 

At the rate of about 6 or 7 beats per second, where the sec 
ond stage begins, the smooth rise and fall in loudness vanishes 
and each beat appears as a single impulse We have already 
seen that, at this same rate of modulation, the pitch of a vibrato 
ceases to nse and fait as it docs at lower rates Beyond this 
critical rate, we are left, in both instances, with a tone having 
an intermittent, throbbing character So similar are the sensa 
tions in the two instances that the ear has great difficulty in dis- 
tinguishing certain vibratos from rapid beats 

Then, as the rate of beating increases further, the intermit 
tent aspect gnes way to roughness, and the third stage begins 
The rate where this transition occurs is indeterminate For 
one thing, it depends upon the frequency of the beating tones 
Wcver places the rate at about 166 for tones in the neighborhood 
of 1000 cycles 

The upper limit for the perception of beats, in the form of 
roughness, also cannot be set with precision, but the evidence 



BEATS 


243 


clearly indicates that this limit is higher when the beatmg tones 
are of high frequency Intensity is also a factor in determming 
this limit, although systematic studies of its effect at different 
frequencies appear not to have been undertaken 

Regarding the apparent pitch of a beatmg complex, Wever 
was able to conclude, from a consideration of most of the evi 
dence gathered by previous writers, that (1) When the differ 
ence between the frequencies of the two primary tones is low, 
the perceived pitch appears to lie between the primaries The 
tone whose pitch is thus perceived is called the intertone (2) 
With a greater difference m frequency, the two primaries step in 
beside the intertone This change probably occurs at about 8 
beats per second (3) W ith still greater d ifference in frequency, 
the intertone disappears and the primaries alone remain (4) 
When the primaries are sufficiently separated in frequency so as 
not to stimulate overlapping regions on the basilar membrane, 
these two tones are perceived as clearly distinct and without any 
roughness due to beatmg 

From our general notions regarding the spread of dis 
turbance in the cochlea, we should anticipate that the amount 
of overlapping would be less for small than for large intensities 
In fact, tones which normally give good beats may cease to do so 
when they are both made very weak Casual observation has 
demonstrated this possibdity On the other hand, when two 
tones are very near in frequency and both are slightly below 
the auditory threshold in intensity, they may become audible 
during the period when their phases are such as to reinforce 
one another Under these conditions, two inaudible tones pro- 
duce audible beats 

The general rule regarding intensity is that beats are max 
imally prominent when the intensities of the primaries are 
equal It is only then that two tones actn ating the same sys 
tern can completely cancel each other during the period of 
phase-opposition When one tone is less intense than the other, 
the cancellation is only partial, and, when the net change in 
amplitude of the stronger tone due to interference from the 
weaker tone does not attain a certain minimal value, no beats 



2-H 


MODULATION VIBRATO AND BLATS 


arc heard The determination of this minimal value was pre 
cisely the goal of Reisz’s experiment, which we discussed in 
connection with DL’s for intensity (pp 136-141) 

It must be noted that, just as two primary tones maj produce 
beats, so may any of the harmonics of these tones, regardless 
of how the harmonics are generated Particularly interesting 
is the possibility of hearing beats between various harmonics of 
two tones when the harmonics are generated in the ear (see 
Chapter 7) The presence or absence of roughness created by 
the beating of aural harmonics may determine, according to the 
well known theory of Helmholtz, whether two objectively pure 
tones appear consonant or dissonant, when sounded together 
Finally, one may consider the relation between beats and 
difference tones Since they both occur at the same frequency, 
people have sometimes tended to confuse them, and to speak 
of the difference tone as though it were merely the tone per 
ceived as a result of rapid beats Actually however, beats and 
difference tones have essentially nothing to do with one another 
They are produced by two entirely different principles If the 
ear were a more sharply tuned analyzer, we should hear no 
beats, but we should still hear difference tones On the other 
hand, if the ear were a linear system, producing no distortion, 
we should hear no difference tones but we should still hear 
beats As it is, we hear both beats and difference tones simul 
taneously Let us note the effect in the cochlea when two tones, 
2000 and 2200 cycles, are sounded simultaneously The two 
tones stimulate regions on the basilar membrane which overlap 
to some extent, and in this region of overlap interference occurs 
and produces beats However, during the transmission of the 
two tones to the inner ear, distortion has occurred, producing a 
new frequency — the 200-cy clc difference tone This component 
frequency activates a region of the membrane near the helico- 
trema which vibrates quite independently of the beating that 
takes place elsewhere 



THE MEASUREMENT OF ROUGHNESS 


245 


THE MEASUREMENT OF ROUGHNESS 
Although in discussing the three stages which can be de 
tected m the phenomenon of beats, Wever has distinguished 
roughness (the third stage) and mtermittence (the second 
stage), this distinction is very much dependent upon the criteria 
of judgment brought to bear by the listener It is quite plain 
that the stage of mtermittence could equally well be character 
lzed as “rough ” Indeed, whenever a modulation occurs at a 
rate faster than about 20 per second, it engenders a sensation of 
roughness, and this roughness persists, although undergoing 
several qualitative transformations, until the rate reaches a high 
value 

Now, if we could select a suitable standard of roughness, it 
should be possible to determine the relative roughness of various 



Fic 100 Showing how the roughness of tones (measured by the ordinate) 
changes with the rate of modulation (After Bekesy 21 ) 

sensations Bekesy (21) adopted as a standard the tones 3000 
and 3050 cycles sounded simultaneously at equal intensities 
The roughness of the resulting beats he could then control by 




246 


MODULATION VIBRATO AND BEATS 


increasing or decreasing the total intensity of the beating com 
plex, and the roughness of any other sound he could rate in 
units of this intensity, which he measured in terms of the max 
imal sound pressure of the beating tones The ability of ob 
servers to give verifiable results justifies Bekesy’s choice of this 
standard 

In order to show the relation between roughness in an ampli 
tudc modulation and the rate of the modulation, B£kesy equated 
his standard to three modulated tones, each having an intensity 
of 10 dynes per square centimeter The results are shown m 
Fig 100 The roughness rises to a maximum in each mstancc 
and then falls off with increasing rates of modulation The low 
tone (200 cycles) loses its roughness more rapidly than either of 
the higher tones This finding is consistent with the fact that 
the maximal rate for the perception of beats is higher at the 
higher frequencies 

Bfkesy’s technique for the measurement of roughness also 
enables us to determine the effect on a beating complex of a 
change in the relative intensities of the two beating tones 



MTEMS TY or LOWER TOME (OYNE/CM?) 

Fig 101 SI owing how the rougl ness of the beating of two to ics 750 and 
800 cycles changes with the intens ty of the lower (750-cycle) tone. The 
800-cyclc tone had an intens ty of 10 dynes per square centimeter (After 
Bfkesy 21 ) 

Thus, when 750 and 800 cycles are sounded together at an in 
tensity of 10 dynes per square centimeter, they exhibit a degree 
of roughness measured by the intensity of the standard combma 




THE MEASUREMENT OF ROUGHNESS 


247 


tion (3000 and 3050 cycles), as pictured in Fig. 101. Then, as 
the intensity of one of the tones (750 cycles) is reduced to the 
various values shown on the abscissa, the roughness declines in 
such a way that the intensity of the standard must be reduced 
in order to hold the two roughnesses equal. At the limit, we 
might expect that, with sufficient reduction in the intensity of 
the 750-cycle tone, all roughness would disappear, and then, in 
order to be equally rough, the standard would need to be 
extinguished. 



CHAPTER 10 


THE MECHANICS OF THE EAR 

We have studied, m the previous chapters, the nature of our 
sensations when we listen to auditory stimuli We have ex- 
amined the relation between our discriminatory responses and 
the dimensions of the stimulating sounds These studies have 
been designed principally to answer the question, " What do we 
hear when we listen ? ” We shall now turn our attention more 
particularly to the mechanical and physiological processes in- 
volved m our auditory responses We shall endeavor to see 
how the ear responds to sound waves and converts them into 
neural events which underlie the discriminatory reactions which 
we call heanng In other words, we shall try to answer the 
question , " Hotv do we hear when we listen ?” 

Sound waves are a type of atmospheric disturbance whose 
detection requires a specialized hind of mechanical system 
The ear is precisely such a system It is a very delicate and 
highly complicated mechanical device — certainly the most re- 
markable mechanical system m the human body Its astound 
mg sensitivity to minute disturbances, its ability to acquaint 
the brain with displacements of the eardrum which arc smaller 
than the diameters of molecules (Chapter 2), and its power of 
resolving complex wave forms into their Fourier components 
make the ear a masterpiece oE mechanical engineering In 
order properly to understand the action of the auditory mecha 
nism under the impact of sound waves, we must investigate it 
from the point of view of (1) its mechanical properties and 
(2) its function as a transducer capa'b'le of converting mecham 
cal energy into nerve impulses The nature of the latter func- 
tion is disclosed chiefly, as we shall see later, by the electrical 
effects which accompany it The mechanical properties of the 
ear wc must understand, for the most part, by appealing to 
known mechanical principles, on the assumption that they hold 
248 



THE ANATOMY OF THE MIDDLE EAR 


249 


true in a system like the ear This procedure necessitates care- 
ful anatomical measurement of the parts of the ear, especially 
of the middle and inner ear In this chapter we shall review 
the essential anatomy of the ear and consider how it behaves 
as a mechanical system 

THE ANATOMY OF THE MIDDLE EAR 
The external auditory canal of the human ear is about 2 5 cm 
long and 07 cm in diameter It is closed at its inner end by 
the eardrum, or tympanic membrane, a cone shaped structure 
with its apex directed inward, which is placed somewhat 
obliquely across the end of the canal Internal to this mem 
brane lies the irregular shaped cavity of the middle ear, between 
1 and 2 cc in volume, containing the three ossicles ( malleus , 
incus, and stapes) and their supporting ligaments In the 
medial wall of the middle ear cavity arc two openings in the 
temporal bone, giving access to the inner ear These, from their 
shapes, are known as the oval window and the round window 
The round window is covered by a membrane, and the oval 
window is filled by the footplate of the stapes, which is held in 
place by elastic ligaments Another opening into the middle 
ear cavity is the 'Eustachian tube, which connects with the 
nasopharynx Ordinarily the Eustachian tube is closed at its 
lower end, but it regularly opens during the act of swallowing 
and thereby allows equalization of any difference m pressure on 
the two sides of the tympanic membrane Figure 102 is a 
schematic diagram of the middle ear and also the vestibule, 
semicircular canals, and cochlea The cochlea is here repre 
sented as straight, although actually it is coiled in the form of a 
snail shell of two and one half turns 

Figure 103/4 is a photograph of the three human ossicles 
They are normally oriented one to another approximately as 
shown m the photograph, but the articular surfaces have there 
been separated As Fig 102 indicates, the handle o{ the 
malleus is firmly attached to the tympanic membrane, and its 
lower tip lies practically at the center of the membrane Viewed 
through the external canal, the tympanic membrane appears 



250 


THE MECHANICS OF THE EAR 


as a circle Under proper illumination, the handle of the mal 
leus can be seen through the membrane as a vertical radius 
Malleus and mens articulate closely with one another by means 
of the large irregular surfaces illustrated in the figure The 
joint is bound tightly by ligaments, and both ossicles arc 
attached to the walls of the middle ear by elastic ligaments, so 
that they are free to vibrate in response to movement of the 



Fig 102 Schematic diagram of the internal ear The cochlea is repre 
sented as straight instead of co led (After Bekfsy 19 ) 

handle of the malleus The long process of the incus articulates 
with the head of the stapes The footplate of the stapes is 
snugly sealed in the oval wmdow by another clastic ligament 
(For the dimensions of the ossicles sec Stuhlman or Piersol ) 
Two muscles attach to the ossicles The smaller of these 
the stapedius, is attached to the head of the stapes close to its 
articulation with the incus Its contraction draws the head of 
the stapes outward and downward in a direction opposite to 
tne inward and upward movement dr die ’long process eft "die 
incus caused by increase of pressure on the outside of the tym 
panic membrane The other muscle, the tensor tympam at 
laches to the handle of the malleus and draws it inw ard, thereby 
placing the tympanic membrane under tension The effect of 
its action on the stapes is to force the footplate upward and 



THE ANATOMY OF THE MIDDLE EAR 


251 


inward into the oval window, and its action is thus antagonistic 
to the action of the stapedius. When both muscles contract 
simultaneously, as they usually do, the effect is to bring the 
ossicles into closer approximation, to draw the tympanic mem- 
brane inward and increase its tension, and, since the tensor 
tympani is apparently more powerful than the stapedius, to 
force the footplate of the stapes inward. 



Fig 103 Left Photograph of the malleus, incus, and stapes The ossicles 
have been separated, but otherwise are in approximately their normal posi- 
tions relative to one another 

1 — articular surfaces of malleus and incus 

2 — incus 

3 — footplate of stapes 

4 — handle of malleus 

Right Photograph of the medial wall of the middle ear, showing stepes, 
oval window, and round window The stapes has been lifted out of the oval 
window and the tendon of the stapedius muscle has been cut. 

1 — head of stapes 3 — round window 

2 — footplate of stapes 4 — oval window 

(Bckesy, 19) 

Neither of the intra-aural muscles is readily visible on exam- 
ination of the middle ear, even at surgical operation, since they 
he in bony canals and only their tendons project into the middle- 
ear cavity. Bekesy suggests that this arrangement is most 
fortunate, in that the bony casing prevents the muscles from 
vibrating laterally when the ossicles transmit sound-waves. 
Such vibration of the muscles would distort the transmitted 
sounds. 




252 


THE MECHANICS OF THE EAR 


ROUTES FOR THE TRANSMISSION OF SOUND 
TO THE INNER EAR 

Sound waves entering the external canal may reach the inner 
ear by three main routes The most important is by means of 
the ossicular chain across the middle ear from the tympanic 
membrane to the oval window The second of these routes 
also involves the tympanic membrane, but transmission across 
the middle ear is by means of air waves instead of by movement 
of the ossicles These air waves fall upon the round window 
and cause vibration of the round window membrane exactly 
as the air waves in the external canal inmate vibrations of the 
tympanic membrane The third avenue of approach docs not 
involve the tympanic membrane Sound-energy is taken up 
by the walls of the canal and transmitted through the bones of 
the skull around the middle ear to the inner ear In this case 
of so-called bone-conduction, the sound need never enter the 
external ear, but may be picked up directly by the skull If the 
skull touches a hard vibrating object, this form of conduction 
becomes important and it may be employed to practical advan 
tage in the event of damage to the mechanism of the middle 
ear (see Chapter 11) 

It has long been known from clinical experience that restne 
tion of the movement of the ossicles by adhesions resulting 
from old inflammatory processes or by bony union of the stapes 
with the margins of the oval window seriously reduces the 
efficiency of the middle ear as a transmitting mechanism Like 
wise, a disruption of the ossicular chain may greatly impair the 
efficiency of the middle ear, but it has not been possible to study 
experimentally in man the effect of simple interruption of the 
continuity of the ossicular chain In animal experiments, how- 
ever, where the electrical activity of the cochlea (see Chapter 13) 
may be used as an indicator of sound transmission, we may 
compare the efficiency of the middle car before and after dis 
articulating the incudostapedial joint and removing a small 
piece of the incus The elevation of threshold resulting from 
this operation is about 60 db on the average (Wcver and 
Bray, 7) 



ROUTES FOR THE TRANSMISSION OF SOUND 


253 


It is also known, as a matter of practical experience, that a 
rather extensive hole m the eardrum causes only very slight 
loss of hearing Lorente de No and Hams used the reflex con- 
traction of the stapedius muscle as indicator of sound transmis- 
sion in the ears of animals, and showed that it is even possible to 
cut entirely around the tympanic membrane and cause a hear- 
ing loss of not more than 20 or 30 db More surprising, how- 
ever, is the observation that some individuals who have lost, not 
only the tympanic membrane, but also malleus and incus may 
nevertheless retain hearing that is within 20 or 30 db of normal 
Such cases are exceptional, however Usually when drum, 
malleus, and incus are missing, the sensitivity of the ear is 
reduced by 40 to 65 db Figure 104 shows the loss of sensitivity 



5 IO 20 50 IOO 200 500 COO 5000 
FREQUENCY 


Fig 104 Average hearing loss of five ears without tympanic membrane, 
malleus or incus, referred to the average sensitivity of the normal ears of the 
same individuals (After Bekesy, 25 ) 

of five such ears compared to the average sensitivity of the 
normal ears of the same individuals 

When the drum and one of the major ossicles are missing, 
'hearing occurs "by air-conduction direct to the round window 
(Bekesy, 25). Then the waves of sound-pressure reach the 
sensory mechanism m opposite phase to the waves carried in by 
a normal ossicular chain This difference m phase is under- 
standable when we recall that waves brought by the ossicles are 
delivered to the oval window, whereas airborne waves have 
easiest access to the round window Round and oval windows 
communicate with the endolymphatic channels on opposite sides 
of the basilar membrane (cf Figs 102 and 105), and a wave of 




254 


THE MECHANICS OF THE EAR 


positive pressure causes, in the one case, upward, in the other 
case, downward, movement of the basilar membrane The 
proof that such a difference of phase actually occurs consists in 
allowing two observers, one with normal ears, the other with 
one ear normal and the other ear lacking drum and ossicles, to 
listen with the right car to one source of sound and with the 
left car to another Tones of slightly different frequency are 
then delivered to the two ears The listeners experience a source 
of sound that seems to shift its position as the phase of the sound 
waves reaching the two ears vanes (cf Chapter 6) The 
normal individual and the subject with one damaged ear local 
ize the apparent source of sound on opposite sides of the median 
plane, showing that the phase of the sound waves has been 
reversed in the abnormal ear When the abnormal ear is then 
provided with an artificial membrane and an ossicle consisting 
of a fine bristle attached to the artificial membrane and touching 
the promontory of the petrous bone, the hearing of that ear is 
somewhat improved (cf also Pohlman, 3), and the phase 
relations of the sounds heard by it arc restored to normal 

THE SIGNIFICANCE OF THE BOUND WINDOW 
FOR HEARING 

The experiments just described show that sound waves can 
enter the inner car by way of the round window Since they 
affect the mechanism of the inner ear in opposite phase to those 
transferred by the ossicles and oval window, they must tend to 
reduce the effectiveness of the latter It would seem, therefore, 
that the round wmdow is a hindrance to the most effective 
working of the normal auditory mechanism Some support 
for this view is to be found in the experiments of Hughson and 
Crowe, who utilized the electrical activity of the cochlea and 
auditory nerve as a measure of hearing When they place! 
pledgets of cotton on the round window, the electric potentia s 
increased A similar effect was produced by placing a penos 
teal graft over the round window and testing the animals from 
two days to seven weeks later It is probable that the improve 
ment of response resulted not so much from ‘immobilization 



THE MECHANICS OF THE OSSICLES 


255 


of the round window and prevention of loss of energy from 
within the inner ear, as Hughson and Crowe believed, as from 
protection of the round window from interfering airborne 
sound waves If the walls of the cavity of the inner ear were 
rigid throughout, no movement of fluid m the canals of the 
cochlea would be possible, and the basilar membrane would 
not vibrate The round window forms an elastic termination 
of the scala tymparu, and thereby allows movement of the 
endolymph and vibration of the basilar membrane If, there 
fore, the round window membrane is too rigidly fixed, a dim 
mutton in auditory acuity should result If, however, it is not 
rigidly fixed, but partially protected from airborne sound 
waves, some improvement of hearing might occur 

THE MECHANICS OF THE OSSICLES 

In animal experiments and in post mortem studies on human 
ears, tiny mirrors have been attached to the tympanic mem 
brane or to various ossicles (Krarnz, Dahmann, B6ke$y, 25) 
It is thus possible to record photographically the direction and 
amplitude of the movements not only m response to sound, but 
also in response to contraction of the intra aural muscles and 
to increased atmospheric pressure 

The malleus and incus are so closely bound together that, in 
response to moderate pressures, they vibrate as a single unit by 
rotating about an axis in the ligament supporting the malleus 
In fact, as long as the amplitude of vibration of the ossicular 
chain is so small that the stapes readily can follow it, and as long 
as the ossicular joints offer greater resistance than the fixation 
of the stapes m the oval window, the whole chain vibrates as 
a closed mass All the ossicles respond to high as well as to low 
tones with measurable vibrations All the ossicles, as might be 
expected from their elastic suspension, may move in several 
planes, but one principal plane predominates This plane 
corresponds to the in and-out movement of the handle of the 
malleus and the long arm of the incus The effective lever arm 
of the incus is slightly longer than that of the malleus, so that 
the amplitude of motion delivered to the stapes is increased in 



256 


THE MECHANICS OF THE EAR 


comparison with that of the handle of the malleus at the center 
of the tympanic membrane in the ratio of 4 to 4 8. 

The stapes does not move directly in and out like a piston, 
but rocks Idee a bell-crank lever. Examination of the ligament 
which attaches it to the border of the oval window shows that 
the ligament is thick and tight at the lower posterior pole and 
is broad and thin at the upper anterior pole. The lower pos- 
terior pole acts as a fulcrum about which the stapes rotates 
(Fig. 105). The movement of the long process of the anvil is 



Fig 105 Schematic diagram of the tympanic membrane, the ossicles, and 
the basilar membrane The solid figures of the ossicles and the solid lines 
for the tympanic, the basilar, and the round window membranes show the 
positions of these structures at rest The open outlines of the ossicles and 
the broken lines for the membranes show their positions following inward 
displacement of the tympanic membrane by a sound wave (See Fig 102 for 
names of structures ) 

so directed as to produce the bell-crank movement of the stapes 
most efficiently. When the intensity becomes very great the 
mode of vibration of the stapes alters, so that instead of pivoting 
at its posterior pole it rocks about the long axis of the footplate 
(BekSsy, 25). This change of vibration reduces the resulting 
movements of the fluid in the inner ear and serves as a protective 



THE MECHANICS OF THE OSSICLES 


257 


mechanism The change is made possible by the relative flex- 
ibility of the incudostapedial joint, and the level of stimulation 
at which it occurs Bekesy identifies with the threshold of pain 
and tactile sensations m the ear (see Fig 19, p 59) 

It should be noted that Dahmann’s view that incus and 
malleus vibrate as a dosed system is opposed to the earlier con- 
cept of Helmholtz that there was sufficient movement in the 
joint between these two ossicles to allow of the effective delivery 
of positive pressure to the stapes, but no r of negativ e pressure 
As a matter of fact, large alternating pressure applied to the 
tympanic membrane causes more movement of the malleus out 
ward than inward The movement of the incus shows the same 
asymmetry but to a lesser degree The movements of the stapes 
are more nearly, but not entirely, symmetrical (see Fig 106) 



Fig. 106 Photographic record, obtained by mirrors attached to the ossicles 
showing the displacements of malleus incus and stapes produced by equal 
pressures inward and outward applied to the tympanic membrane. The dis- 
placements outward are greater than the corresponding displacements inward 
as shown by the vertical components of the records (After Dahmann ) 

Additional insight into the mechanics of the ossicular chain and 
the basis of its nonlinear performance has been obtained from 
experimentation upon a carefully constructed model of the 
ossicles (Stuhlman) The nonlinearity depends upon the flex 
ible character of the joint between the malleus and incus The 
performance of the structure is more nearly linear the more 
ngidly this joint is locked If the articulation is loose, the ratio 
of movement of malleus to incus is 2 to 1 for inward motion 


258 


THE MECHANICS OF THE EAR 


and 1 to 1 for outward motion of the tympanic membrane. The 
motion of the malleus pushes the incus backward, while the 
lenticular process at the end of the long crus of the incus passes 
through a complex three-dimensional displacement somewhat 
resembling the rolling of a pestle in an oval bowl. The dis- 
placement of the footplate on outward motion of the malleus is 
twice as great as it is for the same inward motion. The forces 
developed under these conditions are summarized in Fig 107. 



Fic 107 Graph of the force exerted on the stapes « a function of the 
degrees of rotation of the matleus Note the nonlinearity and asymmetry of 
the curve as a whole Point A, at the middle of the central linear portion of 
the curve, is the point about which the curve is most nearly symmetrical. It 
does not coincide with the position of rest (0) 

Inward rotation of 30° dislocates the malleo-incudal joint The resulting 
decrease in pressure on the stapes is indicated by the broken line at the upper 
end of the curve The measurements were made on a scale model of the 
ossicles (Mtci StnWsaaft } 

Extreme inward motion of the malleus dislocates the joint be- 
tween malleus and incus, but dislocation does not occur on 
outward motion. This dislocation may well serve as a me- 
chanical protective device against great inward pressures. 

The loose coupling of the malleo-incudal joint is an impor- 




THE ACOUSTIC IMPEDANCE OF THE EAR 


259 


tant source of asymmetry and nonlinearity in the mechanical 
performance of the auditory mechanism, but it should not be 
regarded as the sole factor The various elastic structures, such 
as the drum, the ligaments of the ossicles, and the basilar mem 
brane may all contribute to provide for the ear a characteristic 
like that shown by the curve of Fig 82 (p 195) 

THE IMPORTANCE OF THE OSSICULAR 
MECHANISM 

The complicated chain of ossicles found in mammals is 
not essential for hearing, but it probably does represent an in 
crease of efficiency over the simpler mechanism in birds The 
bird has, instead of three ossicles, only a single bone, the 
columella, which is analogous to the stapes It is worth noting 
that the inner ear of the bird is also more primitive than that 
of the mammal The rods of Corti and the tunnel (cf Fig 
113) are missingj and simply a group of sensory cells and sup- 
porting cells lie along the basilar membrane (Retzius) 

The effect of the tympanic membrane and ossicular chain 
m improving the efficiency of hearing is fourfold First, they 
provide for preferential delivery of sound-energy to the oval 
window as opposed to the round window Second, they serve 
to collect energy from a relatively large cross-section of air and 
deliver it to the much smaller area of the footplate of the stapes 
This, as will appear below, is an important advantage in passing 
from air as a conducting medium to a fluid such as the en 
dolymph Third, they provide a slight mechanical reduction 
in amplitude of motion between the tympanic membrane and 
the part of the stapes which is directly in contact with the fluid 
of the cochlea Finally, in conjunction with the ultra aural 
muscles, they provide a protective mechanism for the inner ear 
against loud low tones without undue impairment of hearing 
for faint tones of high frequency (see p 267) 

THE ACOUSTIC IMPEDANCE OF THE EAR 

Any physical system capable of vibration presents a certain 
resistance, or, more broadly, an impedance to vibratory energy 



260 


THE MECHANICS OF THE EAR 


delivered to it This is analogous to the impedance which an 
electric circuit presents to the flow of current The greatest 
efficiency in the transfer of energy from one system to another 
is attained when the impedances of the two systems are equal 
Air and water differ widely in density and elasticity, and con- 
sequently in acoustic impedance whenever equal cross-sectional 
areas are juxtaposed This principle is clearly recognized by 
engineers in the problem of producing and detecting sound 
waves in water by submarine signaling devices In going from 
air to water a large cross section of air should be coupled to a 
small cross section of water, if the device is to be efficient 
The ossicles, as a lever system, connect the tympanic mem 
brane, whose cross-sectional area is approximately 90 sq mm, 
with the footplate of the stapes, whose area is 3 2 sq ram The 
full significance of this ratio of 90 to 3 2 is uncertain, because 
neither the tympanic membrane nor the stapes moves m and out 
as a rigid piston The tympanic membrane is a flexible struc 
ture fixed at the edges, and the stapes rocks about one end of 
its footplate This rocking of the stapes probably provides a 
slight reduction in the effective amplitude of the footplate as 
compared to the amplitude of the center of the tympanic mem 
brane, but the motions of the ossicles are so complex and unccr 
tam that the amount of the reduction has not been determined 
with exactitude It is probably of the order of 2 to I This 
reduction, plus the difference in area between the tympanic 
membrane and the footplate of the stapes, aids m matching the 
impedance of the inner ear to that of air Whether this match 
is exact or not we do not know, for we have not determined the 
exact impedance of the inner ear The acoustic impedance of the 
ear as a whole has been measured, however, and the comparison 
of its value with that of air gives an indication of the efficiency 
of sound-detection by the ear, irrespective of how the matching 
is brought about Troger devised i method for measurement 
of the acoustic impedance of the ear, which depends upon the 
production of standing waves in a tube system whose end is 
dosed by the tympanic membrane He deduced, from his 
measurements, that the membrane is to be regarded as a com 



THE ACOUSTIC IMPEDANCE OF THE EAR 


261 


plex impedance, having an elastic character, which is very great 
for low frequencies, but which reaches a minimum at 800 cycles 
(Fig 108) At this frequency, the efficiency of energy transfer 
must be high, smce air and ear show the same numerical value 
for their impedances Measurements of the ears of various ob 
servers give good agreement up to about 600 cycles Above 
this value, each individual shows various points of resonance, 
and one subject may differ considerably from another, par 
ticularly near the points of resonance 

A close relationship between the impedance of the ear and 
the threshold of sensitivity was found by Geffcken, who further 
showed that certain objections raised by Wien against the 



Fio. 108 Impedance of the human ear The un ts aie in ihe centimeter 
gram second system On this scale the impedance of air is 40 as shown by 
the broken line. (After Troger ) 

resonance theory of hearing are met by recognition of the part 
played by variations in the impedance of the tympanic mem 
brane with frequency It is also evident that the sharp minor 
maxima and minima found in individual threshold curves 
may well depend upon points of mechanical resonance, smce 
the mechanical vibrating system of drum, ossicles, and inner ear 
is obviously very complex Each part may have a resonant 
pomt of its own which appears as a maximum or a minimum 
on any over all curve, whether for threshold or for impedance, 
which expresses the performance of the ear as a whole 

At very low frequencies (below 100 cycles) the impedance 
of the ear is determined not entirely by the mechanical prop 
erties of drum, ossicles, and inner car, but is modified signif 
icantly by the air in the cavity of the middle ear itself, which 




262 


THE MECHANICS OF THE EAR 


acts as a cushion and tends to diminish slow excursions of 
large amplitude Measurement of the acoustic impedance of 
the ear at a frequency of 5 cycles (Bekfsy, 24 and 25) shows 
that it is equivalent to a closed chamber approximately 2 0 cc 
m volume This value agrees closely with the estimates of 
capacity based on purely anatomical studies (see p 249) The 
normal air-cushion may act as a protective mechanism against 
sudden extreme changes in pressure or very loud sounds of low 
frequency If, however, the membrane is perforated by a hole 
1 mm square, the cushioning effect of the air in the middle ear 
is lost When the middle ear is opened through the temporal 
bone without damaging the tympanic membrane, the imped 
ance of the ear at 10 cycles corresponds to a volume greater than 
8 0 cc The mastoid air cells communicating with the middle 
ear cavity differ considerably from one individual to another 
and arc often small enough to absorb sound and change the 
resonance of the middle ear 

THE NATURAL PERIOD OF THE EAR 
Several direct measurements have been made of the natural 
period of vibration of the ear in response to sudden brief dis 
turbances such as the sound of an electric spark Figure 109 
shows two records of the damped free oscillations of the malleus 
following stimulation by a sudden sound A hole was drilled 
through the temporal bone of a human corpse and a tiny mirror 
attached to the handle of the malleus The ear was then stim 
ulated by the sound of an electric spark The resulting pat 
terns of vibration differed as shown in pictures A and B of 
Fig 109, depending upon the form of the sound wave Figure 
B illustrates with particular clarity the natural period of vibra 
tion of the structures of the middle ear and also the rapid damp 
in?, of these vibrations Frank and Broemser originally gave, 
as the natural frequency of the ear the figure of 800 to 1500 
cycles, which is confirmed by Bekcsy (25, 15) Kobrak s fig 
ures, 550 to 800 cycles are a little lower, whereas Davis, Derby 
shire, Lune, and Saul give 1200 to 1700 cycles for the cat All 
agree that the over all vibration of the ear is highly, but not 
critically, damped Kobrak gives a damping factor equal to 



THE ACTIVITY OF THE INTEA AURAL MUSCLES 


263 


about one half the value for critical damping Under this 
damping the auditory mechanism executes a few rapidly 
damped oscillations at its natural period following sudden dis- 
placement from its position at rest and also following the sudden 
onset or cessation of strong stimulation 
at any frequency This general 'off A 
effect’ is not to be confused with the 
Helmholtzian idea that specific portions 
of the basilar membrane continue to 




vibrate for a few cycles at the same fre 
quency as the previous stimulating tone 
The basilar membrane, although differ 
entially sensitive in various regions to 
different frequencies, appears to be essen 
tially critically damped, for the electrical 
activity of the cochlea shows, as an ‘off 
effect,’ only a nonspecific natural period 
which probably depends upon the vibrat 
mg structures of the middle ear (see 
Appendix II) 

The damping factor, and also the 
natural period of the structures of the 
middle ear, are altered by contraction of 
the tensor tympam and the stapedius 
muscles ( Dahmann) The transmission 
factor is reduced by 30 per cent and the 
elastic component of the impedance is 
increased during voluntary contraction 
of these muscles (Geffcken) The effect upon the sensitivity 
of the ear to various frequencies will be considered below, but 
all the changes seem to be those which we should expect on 
physical principles from an increase in the tension of an elastic 
vibrating structure 


B 


Fic 109 Vibration of 
the ossicular chain in 
response to single clicks 
recorded by means of a 
mirror attached to the 
malleus (After Bek&y 
25 ) Note the difference 
in timescales of A and 
B 

A — response to a 
sharp click. 

B — response to a dull 
click, showing natural 
period and rate of decay 
of vibration with special 
clarity 


THE ACTIVITY OF THE INTRA AURAL MUSCLES 

Contraction of the muscles of the middle ear has been 
studied directly in animals by attaching mirrors to the tympanic 
membrane or ossicles or by attaching a delicate myograph di 



264 


THE MECHANICS OF THE EAR 


rectly to the tendons More recently the electrical activity of 
the cochlea has been employed in order to observe the effect of 
contraction upon the transmission of sounds Luscher has 
observed directly the movements of the stapedius in a human 
subject with a defective tympanic membrane Occasionally it 
is possible to appreciate the contractions of ones own ear 
muscles by listening to the faint sounds which may be produced 
by the movements of the ossicles or by crepitation of wax on the 
external surface of the membrane Ordinarily, these move 
ments are silent, or very nearly so, but with practice it is often 
possible to apprehend them without much difficulty and also 
to learn to appreciate the direct sensation of stretching or move 
ment associated with the contraction A few individuals arc 
able to contract their intra aural muscles voluntarily 

From the combined results of human and animal observa 
tions the muscles of the middle ear appear to contract reflexly 
in response to irritation of the external canal, of the pinna, or 
of a considerable area of skin surrounding the external ear 
Light stroking or tickling may be enough to elicit this response 
in man, and m the rabbit, under light anesthesia, stimulation 
of the cutaneous auricular nerves gives excellent reflex re 
sponses In the rabbit the reflex responses are essentially bi 
lateral, whatever the source of stimulation (Lorentc de No, I) 
This is probably not generally true for man One of the present 
writers (HD) finds that the responses of his own intra aural 
muscles to light cutaneous stimulation arc essentially homo- 
lateral Contraction of the muscles in question seems to occur 
regularly as part of the act of yawning Whether this contrac 
tion is to be regarded as primarily included in the pattern of 
yawning or whether it is secondary to opening of the Eustachian 
tube is still uncertain 

A definite threshold for reflex response of the muscles to 
sound can 'be cstaVns'neci Tne fnrcino’ifi intensity m rddofis, 
under urethane anesthesia, is some 40 db above the threshold 
for human hearing (Lorentc dc Nd, I) As a function of fre 
quency, the threshold for reflex response parallels rather well 



THE ACTIVITY OF THE INI RA AURAL MUSCLES 


265 


the. human audibility curve, although it is relatively lower for 
very high tones, above 8000 cycles The strength of the reflex 
contraction also varies directly with the intensity of the stimulat 
ing sound Furthermore, the contraction, tends to persist as 
long as the stimulating sound contmues It also appears that 
the stapedius reflex is the more sensitive to tones below 3000 
cycles, whereas above this frequency the responses of stapedius 
and of tensor tympam have approximately the same threshold 

The latency of the contractions of these muscles to the sud 
den onset of a tone is 14 to 16 msec Maximal tension is attained 
tn 100 to 150 msec These times are brief, and it is evident that 
the speed of action of the tensor tympam and the stapedius 
corresponds to the protective reflexes of the limb muscles and 
is not much slower than the blinking of the eyelids In many 
ways, the action of these muscles should be compared with that 
of the eyelid and its musculature, rather than to the muscles of 
the ins of the eye which govern the size of the pupil, for the 
latter are slowly acting smooth muscles However, like the 
muscles of the ins, the muscles of the middle ear also perform 
a protective and adjusting function in relation to their sense 
organ 

The acoustic reflex of the middle ear closely resembles the 
spinal reflexes of skeletal musculature generally, and, m com 
mon with them, is abolished by deep anesthesia (Hallpike, 2) 
It is probably for this reason that they have not been seen by 
more mvestigators of auditory function It should be noted 
that any observation of the relation between the electrical 
phenomena of the cochlea and the intensity of stimulation (see 
Chapter 14) which is undertaken while this reflex is active may 
be modified by the effect of the reflex itself upon the transmis- 
sion of sound across the middle ear (Hallpike and Rawdon 
Smith, 1) The effect of the reflex* upon transmission may 
account for certain differences between the conclusions drawn 
from studies of intact animals and man and from studies of the 
ear in deeply anesthetized animals 



263 


THE MECHANICS OF THE EAR 


THE ANATOMY OF THE INNER EAR 

The anatomy of the inner ear is described in the various 
standard textbooks of anatomy and, apart from details concern- 
ing the numbers of ganglion cells and the mode of innervation 
of the hair-cells, little has been added to our knowledge of this 
subject in recent years It will be convenient, however, to re- 
view the arrangements of the essential structures and to define 
the various anatomical terms which will of necessity be em- 
ployed in describing the behavior of the sense-organ in response 
to sound-waves impinging upon it. 

In the temporal bone of the skull lies the internal ear with 
its sense-organ of hearing It is called the labyrinth, from the 



Fig III Lateral view of the left osseous labyrinth The figure represents 
a cast of the spaces and channels within the temporal bone The membranous 
labyrinth lies within these spaces 

C — cochlea OW — oval window 

V — vestibule R%V — round window 

SC.— semicircular canals 

complexity of its shape, and consists of two parts: the osseous 
labyrinth, a series of cavities within the petrous part of the 
temporal bone, and the membranous labyrinth, a scries of com- 
municating sacs ana* ducts contained within the bony cavities 
The osseous labyrinth consists of three parts: the vestibule, the 
semicircular canals, and the cochlea (Fig. 111). They contain 
a dear fluid, the perilymph, in which the membranous labyrinth 
is situated. 



THE ANATOMY OF THE INNER EAR 


269 


The vestibule is the central part of the osseous labyrinth, and 
is situated just medial to the tympanic cavity of the middle ear 
It measures about 5 mm from front to back, the same from top 
to bottom, and about 3 mm across In its lateral wall is the 
oval window mto which the footplate of the stapes is attached 
by its annular ligament The three semicircular canals, supe 
nor, posterior, and lateral, open into the \ estibule They need 
not be considered m detail, as they are not concerned with the 
function of hearing They do, nevertheless, form part of the 
total chamber to which changes of pressure, generated by move 
ment of the footplate of the stapes, are delivered 

The bony cochlea, which is the part of the inner ear con 
cerncd with the reception of sound, lies horizontally in front of 
the vestibule In shape it resembles a snail shell It measures 
some 5 mm from base to apex, and its breadth across the base 
is about 9 mm It consists of a conical central axis, the 
modiolus, and a canal, the inner wall of which is formed by the 
central axis, wound spirally around it for 2\ turns A delicate 
shelf of bone, the osseous spiral lamina, projects from the 
modiolus and partially subdivides the canal into two parts 
throughout its length A tough membrane, the basilar mem 
brane, stretches from the free border of the lamina to the outer 
wall of the bony cochlea, and completes the separation of the 
canal into two passages, except for a small communicating open 
mg between them, the helicotrema at the apex of the modiolus 
The cochlear division of the eighth cranial (auditory) nerve 
enters the modiolus at its base as shown in Fig 112 The nerve 
cells are grouped as a long spiral ganglion (Cortis ganglion) 
opposite the osseous spiral lamina The terminal filaments of 
the nerve emerge through small openings in the bony structure 
The canal of the cochlea has three openings — one the 
round ivindow, or fenestra rotunda which looks mto the cavity 
of the middle ear but is closed by the round window membrane 
Another elliptical openmg leads mto the vestibule The third, 
a much smaller openmg, is the end of the cochlear aqueduct, 
a tiny canal leadmg through the temporal bone to the sub 
arachnoid cavity at the base of the brain 





THE ANATOMY OF THE INNER EAR 


271 


does not form a single chamber, but consists of two sacs, the 
utricle and the saccule , containing sensory epithelium and sup- 
plied by nerve fibers from the vestibular portion of the eighth 
cranial nerve Neither utricle nor saccule is concerned with the 
function of hearing, although the suggestion has repeatedly been 
made that the saccule may play a part in the detection of vibra 
tions or the hearing of low tones 

The sensory cells concerned with hearing are contained in 
the ductus cochlearts a portion of the membranous labyrinth 
which is arranged as a spiral tube m the bony canal of the 
cochlea and lies along its outer wall on the basilar membrane 
The basilar membrane forms the floor of the ductus cochlearis, 
and a second, much more delicate membrane (Reissners mem 
brane ) extends diagonally from the osseous spiral lamina to the 
outer wall of the cochlea some distance above the outer edge of 
the basilar membrane The ductus cochlearis, which is also 
termed the scala media ends as a blind sac at the hehcotrema 
The portion of the canal of the cochlea above the scala media is 
the scala vestibuli, and the portion below the basilar membrane, 
the scala tympam (see Fig 113) 

The basilar membrane is a stout tendinous layer of closely 
adjacent fibers It has generally been assumed that these fibers 
are under tension, but no direct evidence of such a static tension 
is available The greatly thickened periosteum which forms 
the outer wall of the ductus cochlearis is called the spiral lig 
ament Its lower portion, forms the attachment of the outer 
edge of the basilar membrane The under surface of the basilar 
membrane is covered by vascular connective tissue One artery, 
somewhat larger than the rest, running lengthwise with the 
basilar membrane, is termed the vas spirale The blood supply 
of the cochlea is provided by the internal auditory artery, a 
branch of the basilar artery, which accompanies the auditory 
nerve through the internal auditory meatus from within the 
cranium 

The organ of Corti is a senes of epithelial structures arranged 
along the inner edge of the basilar membrane (Fig 113) A 
tunnel , which is composed of two rows of rods, the inner and 



272 


THE MECHANICS OF THE EAR 


outer pillars, or rods of Corti, forming a triangle with the basilar 
membrane beneath them, divides the organ of Corti into an 
inner and outer portion. The inner rods of Corti stand at the 
attachment of the basilar membrane to the spiral lamina. The 
nerve fibers which innervate the outer portion of the organ of 
Corti pass across the tunnel On the inner side of the inner rods 


2 3 a 



II 10 9 


Fic 113 Photomicrograph of the organ of Coru from the first turn of the 
cochlea of a guinea pig The human organ of Corn is closely similar The 
bending of the rods of Corti and of the basilar membrane is a fixation artefact 
(Lune, 1, 2 ) 


1 — scala vestibuh 

2 — tectorial membrane 

3 — Reissner’s membrane 

4 — external hair-cells 

5 — supporting cells of Deiters 
6— scala media 


13 — internal hair-cell 


7 — Hcnsen’s cell* 

8 — spiral ligament 

9 — basilar membrane 

10 — rod of Corn 

11 — tunnel 

12 — scala tympani 


is a sihgfe row offiair-ccffs, the inner natr-ceus, ana’ on the outer 
sides of the outer rods are three or four rows of smaller external 
hair-cells together with various supporting cells. The terminal 
fibers of the acoustic nerve end m contact with these hair-cells, 
which are the ultimate sensory cells of the organ of hearing. 
Their name is derived from the cilia, or tiny hairs, which pro- 



THE ANATOMY OF THE INNER EAR 


273 


ject from their upper ends mto the endolymph of the ductus 
cochleans 

Above the organ of Com is a semisolid structure consisting 
of fine colorless fibers imbedded in a transparent matrix This 
tectorial membrane is attached at its inner edge to the supenor 
Up of the osseous lamina near the attachment of Reissner’s 
membrane The tectorial membrane varies considerably in dif 
ferent microscopic preparations, and it is still an open question 
whether it pre exists in the living state in the form m which it 
is seen after fixation and preparation for study under the micro- 
scope There is some reason to believe that it may represent a 
post mortem coagulation or condensation of a much more dif 
fuse colloidal structure or material in the ductus cochleans 
(Bowen) 

The resonance theory of hearing postulates that different 
portions of the basilar membrane vibrate selectively in response 
to different frequencies With this possibility in view it is of 
some interest to consider the variations in dimensions of the 
structures of the cochlea as a function of their distance from 
the oval window (see p 277) 

The cross-section of the canal of the cochlea becomes smaller 
as we approach the helicotrema, but the change is somewhat 
irregular The basilar membrane, on the other hand, is nar 
rowest at the end near the round wmdow and vestibule and 
becomes progressively and systematically wider toward the 
helicotrema Figure 114 presents these facts in diagrammatic 
form with actual measurements The tunnel of the organ of 
Com also becomes progressively larger The inner and outer 
sods ncas tbs. twiwi w.mdra'R arc. about 5(1 mrcrcus ua. Vttvgh., 
whereas near the helicotrema they are approximately 85 and 
100 microns, respectively The span of the arch increases from 
20 to 85 microns A group of cells situated to the outer side 
of the external group of hair-cells are known as Henscn’s cells 
(Fig 113), and contain fat globules The fat globules are 
absent in the basal coils (at least in adult guinea pigs), appear 
halfway around the second coil, and show a finely graded in 
crease toward the apex, where their bulk is considerable (Hall 



274 


THE MECHANICS OF THE EAR 


pike, 1) The same arrangement may reasonably be inferred 
for the corresponding cells in the human organ of Cora, since 
in other details the microscopic structures of guinea pig and 
man bear a close resemblance to one another The spiral lig 
ament decreases in size from the vestibule to the helicotrema, 

whereas the tectorial membrane increases progressively in size 



JT COMW.PT TURN 2*> TURN APICAL TURN 


Fic 1 M Diagram showing the dimensions of the basilar membrane and of 
the canals of the human cochlea (Fletcher J based on measu remenu from 
Wrightson and Keith Courtesy of D Van Nostrand Company Inc ) 

from basal to apical end of the cochlear canal These pro- 
gressive, systematic changes in dimensions arc presumably im 
poitant in the dynamics of the cochlea 

There are approximately 3500 hair cells in the inner row 
and about 20,000 divided among the three outer rows The 
inner hair cells are slightly larger (12 microns) in diameter 
than the outer hair-cells (8 microns) and the dimensions of 
each type are constant along the basilar membrane Both the 
internal and the external hair-cells arc quite evenly spaced along 
the basilar membrane 

There are between 25,000 and 29,000 ganglion cells m the 
spiral ganglion of Corti within the modiolus They are not 
evenly distributed along the length of the basilar membrane, 
being more densely congregated in the upper portion of the 
basal turn and fewest m the upper middle and apical sections 
The average numbers per millimeter are lower basal, 934, 





THE ANATOMY OF THE INNER EAR 


275 


upper basal, 1076; lower middle, 971; upper middle and apical, 
502 (Guild, 2). 

Each inner hair-cell is innervated by one or two nerve-fibers, 
and each nerve-fiber makes connections with one or two hair- 
cells (Lorente de No, 2). The external hair-cells, however, 
have multiple innervation. A nerve-fiber may connect with 



Fig, 115, Diagram of the innervation of the organ of Cortl. (After 
Lorente de No, 2 ) 

0 C. — organ of Com 
E.H C, — external hair-cells 

E.S F. — external spiral fibers, each innervating many external hair-cells 

1 S.F — internal spiral fibers, of unknown function 
1HC. — internal hair-cdls 

R.F. — radial fibers, innervating the internal hair-cells 
GC —ganglion of Com Arrows show the direction of the fibers 
away from their cell bodies 
C.F. — centrifugal fibers, of unknown function 
AN. — auditory nerve 

many external hair-cells, extending over a range of as much as 
one-half of a turn, and each hair-cell may be connected with 
several nerve-fibers. The nerve-fibers to the outer rows of cells 
pass out radially from the spiral ganglion, cross the tunnel of 
the organ of Corti, and, upon arriving in the outer rows, turn 
sharply and pass down toward the basal end of the cochlea. 
Figure 115 illustrates this mode of innervation of the hair-cells. 



276 


THE MECHANICS OF THE EAR 


The anatomical arrangement of the nerve fibers in the auditory 
nerve and their central connections will be considered separately 
in Chapters Id and 18 

DYNAMICS OF THE INNER EAR 

Wc have considered the anatomy of the inner ear as it 
appears under the microscope after death The problem of 
how the ear, as a physical system, reacts to pressure waves trans 
nutted to it by way of the footplate of the stapes has not yet been 
solved in all details The small size and relative inaccessibility 
of the cochlea make direct observation difficult On the other 
hand, we can apply, with some confidence, certain physical 
principles, and thereby discover the probable mode of vibration 
of the basilar membrane and what pattern of nerve impulses the 
vibration will set up in the auditory nerve 

The bony labyrinth is a practically closed chamber with rigid 
walls, except for the oval and round windows The fluid 
within it is incompressible, and therefore, when the footplate 
of the stapes vibrates, significant mass movements of the fluid 
within the labyrinth can occur only by virtue of the yielding of 
the round window membrane It is evident from Fig 102 that 
slow inward movement of the stapes can cause a flow of per 
ilymph up the scala vestibuh, through the hehcotrema, and 
down the scala tympam to the round window Figure 105 
illustrates an alternative pathway When the movement of 
fluid up the scala vestibuli is rapid it will be opposed by the 
frictional resistance to flow in the narrow scala and by the inertia 
of the fluid column in the apical turns This process will gen 
erate pressure in the endolymph and hence on Reissner’s mem 
brane (which we may assume to be practically flaccid and un 
resisting) and on the basilar membrane beneath it The basilar 
membrane will bulge into the scala tympam, as shown in Fig 
105, and displace the perilymph within it toward the round 
window The more rapid the movement, the closer to the round 
window is the bulge in the basilar membrane 

We shall be able completely to account for the dynamic 
behavior of the cochlea in response to an acoustic disturbance 



THE DYNAMICS OF THE INNER EAR 


277 


only when we shall have succeeded in setting up and solving the 
differential equations describing the motions of the various parts 
of the cochlear system and in evaluating the several constants 
involved An attempt at such a set of equations is presented 
m Appendix II Here, however, we may consider the nature 
of certain principles which make it reasonable to suppose that 
the basilar membrane responds at different places to different 
frequencies 

Two principal factors operate to place the disturbance of the 
basilar membrane closer to the round window when the fre 
quency of stimulation is raised (1) The basilar membrane is 
broadest near the hehcotrcma and narrowest near the round 
window If the fibers of the membrane are under tension, their 
varying elasticity will make the longer fibers more susceptible 
to movement by low frequencies The strings of a piano 
exhibit a crudely analogous effect The shorter fibers will be 
moved most easily by high tones (2) The mass of the total 
amount of fluid moved will be smaller when the disturbance 
is nearer to the round window Now, m a mechanical system, 
the natural frequency is higher, the smaller the mass of the 
system Consequently, high frequencies will tend to activate 
the ear m such a way as to move a small mass Just what mass 
will be most readily displaced by a given frequency will depend 
upon the additional factors of the stiffness and resistance in 
volved, but the general principle is clear less cochlear fluid 
will be set m motion by a high than by a low frequency 

Roaf sums up the dynamics of the cochlea as follows “It is 
evident that the impedance due to the mass and friction of the 
perilymph will tend to produce greater differences of pressure 
at the narrower end of the basilar membrane with rapid changes, 
whilst slower changes will cause lesser differences, so that the 
basilar membrane will be moved at a wider part ” 

According to these principles, aided presumably by minor 
factors such as variations in the physical constants of the organ 
of Com, the basilar membrane will vibrate selectively at one 
part or another according to the frequency of the sound This 
selective vibration is the fundamental physical basis of the 



278 


THE MECHANICS OF THE EAR 


analysis of sound by the cochlea Direct evidence of such ‘tun 
ing’ of the cochlea and the locations of the regions of maximal 
sensitivity to particular frequencies will be presented in Chapter 
15 

Evidence that a change in the physical constants of the inner 
ear alters the perceived pitch of a tone is presented by Bekcsy 
(4) He observed that when the veins of the neck are com 
pressed, so that the veins and capillaries of the head become 
engorged with blood, the pitch of a tone may dimmish by an 
amount corresponding to a reduction of 2 per cent m frequency 
The effect is most evident with low tones of moderate mten 
sity Distention of the small veins and capillaries presumably 
alters the stiffness and mass of the vibrating structures of the 
inner ear sufficiently to modify the pattern of vibration of the 
basilar membrane 

It should be emphasized here that the preceding discussion 
should not be interpreted as implying a simple resonance theory 
of hearing True, the cochlea behaves as tj the basilar mem 
brane were composed of a row of tuned resonators, in the sense 
that a maximum of vibration occurs at a given place for a given 
frequency, but the physical principles by which this apparent 
tuning is achieved arc far more complex than those involved 
in simple resonant systems For one thing the apparently 
highly damped state of the cochlea rules out the simple picture 
of a row of resonators and requires that we invoke alternative 
concepts, such as those suggested in Appendix II 

TRAVELING WAVES ON THE BASILAR MEMBRANE 

The brief outline of the dynamics of the cochlea presented 
above is based upon inference rather than upon direct observa 
lion It has not yet proved technically possible to observe the 
basilar membrane in vibration with sufficient accuracy to de 
termme directly its pattern of vibration either in response to 
steady tones or to sudden impulses In principle, such observa 
tions might be earned out and the pattern of vibration deter 
mined if the basilar membrane could be viewed under strobo- 
scopic illumination The efforts which have been made do little 



TRAVELING WAVES ON THE BASILAR MEMBRANE 


279 


more than demonstrate the difficulty of the undertaking The 
study of large scale physical models, which attempt to reproduce 
the conditions m the cochlea, is open to the fundamental objec 
tion that assumptions must always be made as to the cor 
respondence between the physical constants of the model and 
those of the ear itself Obviously, until the actual physical con 
stants of the ear are better known than they are at present, the 
conclusions drawn from the study of models are no better than 
the assumptions which enter into their construction 

In spite of this difficulty, certain observations by Bekesy (15, 
2) upon a model of the inner ear are of great interest, for they 
indicate certain features of the dynamics of such a system which 
are not directly evident from the analysis previously presented 
They show that when the stapes is suddenly displaced all the 
basilar membrane does not move simultaneously, as we might 


STAPES HEUCOTREMA 



Fig. 116 Diagram of a traveling wave set up on the basilar membrane by 
outward movement of the stapes (After Bekesy 15) 

expect, but a wave sweeps progressively along it, like the wave 
which travels along a slack rope that has been given a sudden 
shake 

Bekesy reports that under stroboscopic illumination it can 
be seen that a momentary aperiodic displacement of the stapes 
causes the fluid in the neighborhood of the stapes to move with 
it like a piston., so that up to the region where an eddy is gen 
erated when a strong steady tone of 1000 cycles is employed, the 
basilar membrane momentarily bulges up, while the region m 
the neighborhood of the helicotrema remains quiet This be 
havior is illustrated by the heavy line m Fig 116 Shortly 
thereafter the region in the neighborhood of the stapes swings 
back aperiodically to its position of rest, while, at the other end, 
as shown by dotted lines in the figure, a flat traveling wave is 
generated which spreads completely up to the helicotrema The 




280 


THE MECHANICS OF THE EAR 


sudden bulging up of the basilar membrane throughout the 
region up to that which normally responds maximally to 1000 
cycles indicates that the velocity of propagation of the traveling 
wave in the neighborhood of the stapes exceeds that in the 
more distant region 

If one allows the stapes of the model to execute a smgle vibra 
tion, such as would result from a momentary alteration of pres- 
sure in the auditory meatus, the first half of the membrane 
follows exactly the vibration of the stapes, while in the neighbor 
hood of the helicotrema two or three wave peaks, correspond 
mg to highly damped traveling waves, may be seen 

THE VELOCITY OF THE TRAVELING WAVES 

Bekesy does not hesitate to carry over to the human ear the 
conclusions drawn from the observation of his model, nor are 
we entirely without evidence to justify his confidence The 
notion of a form of vibration of the basilar membrane which 
consists of a progressive scries of traveling waves running along 
the membrane enables us to account for the temporal dispersion 
of the nerve impulses in the auditory nerve which are set up by 
stimulation by a sudden sound The pattern of the nerve 
impulses is considered in Chapter 16, but, because of the appar 
ently intimate relation between their temporal dispersion and 
the mode of vibration of the basilar membrane, certain features 
of neural activity must be examined at this point 

When the ear is stimulated by an abrupt sound wave, such 
as a click, the resulting nerve impulses do not all appear simul 
taneously with one another m the auditory nerve Some appear 
after considerable delay The earliest are those generated near 
the middle of the basilar membrane They are the impulses 
which are masked most effectively by tones between 2000 and 
2500 cycles The difference m latency between impulses gen 
erated at this point and those initiated near the helicotrema is 
as much as 15 to 2D msec If we take 30 mm as the length of 
the basilar membrane and the difference in latency between the 
two groups of impulses as 1 5 msec, the speed of propagation of 
the traveling wave along the upper half of the cochlea turns out 



THE VELOCITY OF THE TRAVELING WAVES 


281 


to be approximately 10 meters per second This figure is, of 
course, only an approximation, but it indicates the order oE 
magnitude involved Furthermore, it is almost certain that 
the speed of propagation over the basal half of the cochlea is 
considerably greater than this, since the earliest group of 1 m 
pulses generated near the middle of the basilar membrane 
appears after a latency which may be as brief as 0 7 msec This 
fact indicates a velocity of the traveling wave of at least 20 to 
30 meters per second over this first portion of the basilar mcm 
brane In all probability, there is a gradient of \elocity from 
one end of the membrane to the other 

Another type of eudence for a slow travelmg wave in the 
cochlea is presented by Behesy (15) When a click and a 
steady tone are presented simultaneously to the ear, the click is 
partially masked by the tone A combmation of click and tone 
can be presented to one ear of a human subject, while the other 
ear is stimulated simultaneously by a click alone The inten 
sities of the two clicks may then be adjusted so that they appear 
equal m loudness Under these conditions the observer hears 
a click which seems to come from a source situated to the side 
of the head— the side toward the ear stimulated by click plus 
tone The effect is more pronounced the lower the frequency 
of the masking tone Apparently the low tone masks the late 
component of the excitation due to the click, so that the click 
appears to occur earlier in time Bekesy measured the interval 
by which the unmasked click must be advanced, in time, in 
order that the sound image should be referred to the center of 
the head (see Chapter 6), or m order that it should be physi 
ologically simultaneous with the masked click When a mask 
mg tone of 800 cycles was used, the difference was scarcely 
measurable, but with a 100-cycle tone it was 13 msec He con 
eluded that the apparent advance in time due to masking 
depended upon the elimination, by the low pitched masking 
tone, of the later components of the nerve response, which arise 
when the travelmg wave reaches the apical end of the basilar 
membrane. 

A velocity of 10 or 20 meters per second is far less than the 



282 


THE MECHANICS Of THE EAR 


velocity of sound waves in water, or even in air This relatively 
slow velocity is not a pertinent objection to the hypothesis of a 
traveling wave, however, since the problem is not one of propa 
gation of sound waves m an infinite extent of homogeneous 
fluid, but of the propagation of a wave along a tube having 
an elastic wall The velocity of propagation of a wave of pres 
sure m such a system is much less than the velocity of sound 
and depends upon the elasticity, thickness, and other properties 
of the wall of the tube (cf Appendix II) An interesting and 
analogous case of propagation of a disturbance in an elastic tube 
is the transmission of the pulse wave in an artery With hard 
ening of the arteries, the velocity of the pulse wave is increased 
(C J Wiggers) 

ADDITIONAL FEATURES OF THE TRAVELING 
WAVE 

Let us now consider the probable relation between the form 
of a click, its acoustic spectrum, the region of the basilar mem 
brane maximally stimulated by it, and the manner in which 
the stimulation travels along the membrane from the round 
window to the hehcotrema Various aspects of this total prob 
lem have been treated in other chapters, but it should be instruc 
live at this point to review the picture In so doing we shall see 
how it is possible for a stimulus comprising a continuous spec 
trum of sound-energy to send a wave of disturbance running 
along the basilar membrane in such a way as to set up a distri 
bution of excitation which reflects the distribution of energy in 
the sound spectrum 

These notions are illustrated by the schematic diagrams in 
Fig 117 Click A is a dull click whose pressure wave rises and 
falls rather gradually To the listener click A sounds like a 
dull thud The distribution of energy among the frequencies 
composing the spectrum of this click shows a maximum near 
the low frequency end of the scale In contrast click B is a 
sharp click whose pressure-change is abrupt Click B sounds 
like a sharp crack The spectral distribution for this type of 
click is peaked in the region of high frequencies Now, each 



ADDITIONAL FEATURES OF THE TRAVELING WAVE 


283 


of these clicks initiates a movement of the eardrum, and hence 
of the stapes, which has essentially the form of the pressure 
wave of the click Movement of the stapes tends to compress 
the fluid m the scala vestibuh, but the pressure is relieved by a 
bulging of the basilar membrane into the scala tympani, while 
the pressure in. the scala tympani is relieved by a bulging of 
the round window membrane The wave of pressure tending 



Fig. 117 The three pairs o£ curves represent schematically the wave form 
the corresponding sound spectra and the amplitude of displacement along 
the basilar membrane of a dull click (A) and of a sharp click (B) 


to bend the basilar membrane begins near the round window 
and travels toward the helicotrema at a speed of 10 to 30 meters 
per second The membrane executes a whip like motion, and 
the important question becomes that of determining how the 
motion differs when we ‘crack the whip’ with a dull as against 
a sharp click Because of the factors of mass, resistance, and 
elasticity in the cochlea the sharp click will move, with relative 
ease, the part of the membrane near the stapes, but its effect 
will rapidly diminish as the wave moves farther away The 
traveling wave of the dull click, on the other hand, will be rela 





284 


THE MECHANICS OF THE EAR 


tively more effective near the hehcotrema in producing a dis 
placement of the membrane Hence, it is possible to picture 
the relative amplitude of motion of the basilar membrane at 
various points along its length, as shown in Fig 117 

These curves, showing the relative amplitude of the travel 
ing wave at different distances from the round window, reflect 
the essential features of the curves representing the spectrum 
of the clicks When the maximum of the spectrum is m the 
region of low frequencies, the maximum of the disturbance on 
the basilar membrane is near the hehcotrema A spectrum 
dominated by high frequencies produces a maximum of dts 
placement near the round window This correspondence of 
spectrum to pattern of stimulation shows that, in a sense, the car 
behaves essentially as an analyzer, even for discrete clicks, 
and it explains why some clicks sound sharp and high pitched 
and others sound dull and low pitched (cf Chapter 3) 

TRAVELING WAVES IN RESPONSE TO 
STEADY TONES 

When we apply the concept of the traveling wave to the 
motion of the basilar membrane m response to a steady tone, 
we find that the various parts of the basilar membrane do not 
move up and down in the same phase but that the part near 
the stapes leads, in phase, the part closer to the hehcotrema 
Fletcher (2) applied these principles in a theoretical study of 
the mechanics of the cochlea and deduced a similar type of mo- 
tion for the basilar membrane He was forced to make various 
assumptions concerning the tension and other constants of the 
basilar membrane, but he pointed out that, according to almost 
any reasonable assumption one can make, the pact of the mem 
brane next to the oval window leads in phase of vibration the 
motion of the part near the ‘hehcotrema ’in accordance with 
this view, if motion pictures were to be taken with exposures 
T / 8 seconds apart, where T is the period for one complete cycle, 
we should obtain a series of pictures like those shown in Fjg 1 18 

This picture in no way conflicts with the idea of a region of 
maximal amplitude of vibration for a given frequency The 



TRAVELING WAVES IN RESPONSE TO STEADY TONES 


285 



Fig 118 Diagrams of the pattern of vibration of the basilar membrane 
during one complete cycle of a steady tone The nine heavy lines represent 
the instantaneous patterns of the membrane at successive intervals of one 
eighth of a cycle of the tone Each wave of vibration passes progressively along 
the membrane away from the oval window The amplitude of vibration is 
greatest at 16 mm from the oval window (Fletcher, 2 ) 




286 


THE MECHANICS OF THE EAR 


figure actually illustrates maximal amplitude at the distance of 
16 mm from the oval window — the situation which would be 
produced by a tone of about 2000 cycles 

Bekesy (2) has pomted out that a motion of the basilar 
membrane like that shown in Fig 116 would tend to produce 
small vortices, or eddies, in the liquid above the position of 
maximal stimulation When the exciting tones were loud, he 
observed such eddies, both m his models of the ear and in dis 
sected human ears He believes that the pressure of these 
eddies against the membrane causes stimulation of the nerve 
endings He points out, furthermore, that such an eddy would 
produce a movement of the liquid m the semicircular canals 
and affect the sense of balance in such a way as to cause the 
listener to tilt his head toward the ear receiving the sound 
Precisely this effect appears to occur in response to very loud 
tones The presence of these eddies at high sound intensities 
need not, however, account for the stimulation of the nerve 
endings of the auditory fibers At low intensities, such eddies 
probably do not occur and therefore stimulation must depend 
upon mechanical distortion of the hair cells, due to bending of 
the basilar membrane, rather than upon a hydraulic pressure 
generated by vortices (see p 343) (The possibility of stimula 
tion by a pressure gradient in the fluid of the cochlea has 
been pomted out by Reboul (see Appendix II) but this effect 
is different from the pressure generated by vortices ) 

THE DAMPING OF THE BASILAR MEMBRANE 

An important aspect of the basilar membrane as a vibrating 
structure is its damping factor The damping factor tells us 
how rapidly the vibrations die out after the stimulus ceases 
The impossibility, however, of considering the basilar mem 
brane as a separate structure apart from the fluid which sur 
rounds it should be at once apparent In fact, any attempt to 
discover the damping factor of the basilar membrane itself is 
subject to difficulties arising from the fact that the membrane 
is closely coupled to the rest of the auditory mechanism Hence, 
the rate of decay in the oscillations of the entire auditory sys- 



THE DAMPING OF THE BASILAR MEMBRANE 


287 


tern when, a tone is turned off may or may not correspond to 
the rate of decay attributable to the inner ear alone Further 
more, measurements of the damping of the ear in terms of the 
persistence of an auditory sensation may be vitiated by the factor 
of persistence in the central nervous system (see p 223) 

Considering the auditory mechanism as a whole, several 
efforts to determine its time-constant, or damping factor, have 
yielded values ranging from 33 to 200 msec A representative 
mean value for this tune-constant is 50 msec (Lichte) This 
value means that the amplitude of the oscillations of the ear as 
a whole following the cessation of a tone falls to 1/2 718 of the 
initial value in the time of 50 msec (cf p 263) 

It is probable, however, that if we consider the inner ear 
separately, the viscosity of the fluid in the cochlea renders the 
basilar membrane even more highly damped One effect of 
this large damping is to dull the selective response of the mem 
brane in such a way that a single tone activates an extensive 
area We have already seen (Fig 93, p 216) that a smgle 
tone may mask other tones throughout the audible range, and 
we have interpreted the curves for masking as representing the 
extent of the disturbance attributable to a smgle frequency 
We shall see later (p 330) that there is no evidence in the elec- 
trical activity of the cochlea of any significant persistence of 
vibration of the basilar membrane following the interruption 
of a tone Everything considered, then, we must conclude 
that the inner ear is highly damped and that this damping im 
pairs its resolving power in the analysis of sound waves 



CHAPTER 11 

DEAFNESS AND BONE CONDUCTION 

Impairment of hearing may result from three main factors, 
operating singly or in -various combinations The first factor 
is the failure of transmission of the physical sound wav es to the 
inner ear This type of deafness is frequently designated as 
transmission deafness, and also, since it usually depends upon 
some abnormality of the drum or middle ear, as ‘middle ear 
deafness ’ The second factor is damage to the sensory cells or 
to the nerve fibers and nerve centers immediately connected 
with them This damage produces what has been variously 
designated as nerve deafness or ‘perception deafness * Nerve 
deafness is not always due to degeneration of the auditory nerve, 
but may result from degeneration of the hair-cells of the organ 
of Corti Damage to the hair cells is, however, physiologically 
equivalent to damage to the nerve The third type of deafness 
is the so-called central deafness, and includes conditions in 
which impulses reach the central nervous system along the audt 
tory tracts, but m which the patient is unable to recognize them 
or give them their usual meanings This third type depends 
upon abnormalities or dysfunction of the higher nervous cen 
ters, and will not be considered further in this book 

The distinction between transmission-deafness and nerve 
deafness is of fundamental practical importance, since, as long 
as the essential sensory apparatus and its nerve supply remain 
intact, there is always the possibility that means may be found 
to deliver sound vibrations to that sense-organ at sufficient in 
tensity to stimulate it, and so give rise to useful hearing Once 
the sense-organ or nerve is destroyed, however, there is no"hopc 
of regeneration or of attaining through any other nervous path 
way the differential sensitivity to soundwaves which is the 
necessary basis for the recognition of speech and music The 
effect of nerve deafness on the perceived loudness of sounds is 
discussed in Chapter 4 (pp 131-136) 

2S8 



TRANSMISSION DEAFNESS 


289 


TRANSMISSION DEAFNESS 

The simplest and most obvious form of transmission deaf- 
ness is mechanical closure of the external canal This may 
sometimes occur from the gradual accumulation of wax, and 
lead to a hearing loss of many decibels A similar loss may 
be produced by the accumulation within the middle ear of solid 
or semisolid material, such as pus or exudate from an inflam 
matory process The damping effect of such material on the 
transmission of sound waves may be very large The simple 
damping effect of the material is, however, usually complicated 
by the effects of the increased or decreased pressure within the 
middle ear 

The effect of difference of pressure on the two sides of the 
tympanic membrane is experienced m pure form when we are 
subjected to rapid changes of atmospheric pressure, as m rapidly 
moving elevators or airplanes This effect, and the subjective 
sensations which accompany it, are matters of common expen 
ence, and a temporary deafness, particularly for low tones, quite 
commonly occurs Relief is afforded by swallowing, which 
opens the Eustachian tube and allows equalization of pressure 
on the two sides of the tympanic membrane 

During an infection of the middle ear, the Eustachian tube 
is closed by the inflammatory process, and oxygen is gradually 
absorbed from the air which has been trapped in the middle ear 
The tympanic membrane is then retracted, owing to the dimin- 
ished pressure in the middle ear Fluid is later exuded during 
the acute processes of inflammation and is finally re absorbed 
The result of these processes is first to increase and then, if the 
drum has not ruptured, to diminish pressure in the middle ear 
and cause an acute retraction of the tympanic membrane The 
distention and retraction are both accompanied by a low tone 
deafness The retraction may be relieved by ‘blowing out’ the 
ears In the presence of inflammation of the Eustachian tube 
and middle ear, relief may require cannulation of the Eustachian 
tube Such treatment is often highly desirable as a prophy 
lactic measure, in order to avoid formation of permanent ad 
hesions which would hold the drum in its retracted position 



290 


DEAFNESS AND BONF CONDUCTION* 


During the earlier stages of acute infection, when pus ac 
cumulates m the middle ear, the tympanic membrane bulges 
outward, and may rupture spontaneously unless relieved by 
surgical intervention Surgical perforation is desirable under 
these conditions, since it allows the pus to escape through a small 
opening in the drum, which almost invariably heals following 
recovery from the infection If the drum is not pierced, long 
continued internal pressure may shut off the blood supply from 
portions of the tympanic membrane and cause widespread 
necrosis, so that, after spontaneous rupture has finally occurred, 
repair may not be complete and may leave a permanent opening 
m the drum The degree of permanent loss of hearing follow 
ing such a condition is variable Perforation of the drum per sc 
causes very little loss of sensitivity (Lorente de No, 1, and see 
p 253) Middle ear deafness resulting from infection usually 
depends upon thickening of the drum and the formation of 
permanent adhesions to the ossicles An extensive deficiency 
of the tympanic membrane increases such ‘middle-ear’ deafness 
This common type of deafness, well recognized clinically, is 
characterized by loss of hearing for low and sometimes for mid 
die frequencies, and usually, but not always, retention of sen 
sitivity for high tones It corresponds rather closely to the con 
dition realized, physiologically, during maximal contraction of 
the inner ear muscles, and is to be interpreted in the same way, 
that is to say, as a restriction imposed upon the free vibration of 
the transmission mechanism of the middle ear 

When, as a result of severe infection, the tympanic mem 
brane, and also the ossicles, are completely lost, hearing may still 
be possible The hearing loss in these cases involves high as 
well as low tones (Fig 104, p 253) and is sometimes very sc 
vere It depends upon the exact details of the physical changes 
which have occurred 

Fixation of the stapes in the oval window may occur in 
certain pathological conditions, notably otosclerosis Without 
going into detail concerning this important, but obscure, con 
dition, which usually involves a progressive degeneration of 
some of the hair-cells in the organ of Com (Lurie, 1), we may 



BONE -CONDUCTI ON 


291 


note that fixation of the stapes would be expected, on physical 
principles, to cause a severe low tone loss as well as some loss 
throughout the entire range This is precisely what usually 
occurs 

We cannot be dogmatic about the degree and type of hear- 
ing loss to be expected from any particular abnormality in the 
middle ear Slight differences m the location of an adhesion, 
or in its rigidity, may profoundly modify its influence upon the 
mov ements of the ossicles Although it is true, as a general rule, 
that adhesions and other abnormalities m the middle ear tend 
to depress low tones more than high, it is nevertheless well recog 
mzed that high tones may be involved in transmission-deafness, 
and, on the other hand, considerable pathology may exist with 
out measurable loss of hearing This latter point is illustrated 
by the post mortem studies of Polvogt on the ears of sixty three 
patients whose hearing had been tested and found to be normal 
shortly before death He found that bands of embryological 
or fibrous tissue in the niche of the round window may have 
no effect on the acuity of hearing We must assume that such 
bands do not necessarily interfere with the essential movements 
of the stapes Marked pathological changes of the tympanic 
membrane may also exist without causing any noticeable un 
pairment of the hearing The drum may even be extremely 
retracted and adherent to the mucous membrane over the pro- 
montory On the other hand, nothing was found in any of 
Polvogt’s cases which would interfere with free movement of 
the head of the malleus and incus, or with the normal function 
ing of the Eustachian tube Also the annular ligament which 
surrounds the footplate of the stapes was normal m all cases 

BONE CONDUCTION 

Even when air-conduction is completely abolished by some 
pathological process, it is possible to bear vibrations which are 
effectively transmitted to the bones of the skull The stem of a 
vibrating tuning fork applied to the mastoid process is a simple 
method of obtaining such transmission Tests based upon this 
procedure are employed for the differentiation between deaf 



292 


DEAFNESS AND BONE CONDUCTION 


ness due to loss of air conduction and deafness due to damage 
or degeneration of the sense-organ of the inner ear (American 
Otological Society Symposium) Special bone-conduction re 
ceivers have been designed for use with standard audiometers to 
transform alternating electric currents into mechanical vibra 
tion of the skull They correspond m principle to the familiar 
telephone receiver which generates sound waves The audi 
ometer with its bone conduction receiver is calibrated at various 
frequencies in terms of the threshold of normal ears, and the 
loss of sensitivity of a given car, as a function of frequency, may 
thus be measured quantitatively (see p 64) As long as the 
threshold for bone-conduction remains normal or nearly so, 
we may safely infer that the sense-organ essential for hearing 
remains normal If bone-conduction is impaired the inner 
ear may be damaged, but not necessarily (see p 295) 

It is obvious that bone-conduction may play some small part 
in the normal transmission of ordinary sound vibrations to the 
inner ear, but the efficiency of transfer of energy from air to 
bone is so low that it is usually negligible Nevertheless, m 
determinations of the sensitivity of a partially deafened ear, the 
conduction of sound to the opposite normal ear by bone conduc 
tion may become very important The threshold for such 
transmission to the opposite ear is some 50 or 60 db below the 
threshold for air conduction It is therefore impossible to test 
thresholds more depressed than this, unless some means are 
employed to eliminate the hearing of the normal car This 
elimination may be achieved by masking it with some other 
sound, usually a rough noise (American Otological Society Sym 
posium) Masking prevents the normal ear from detecting the 
faint sound in question, and does not appreciably affect the 
threshold of hearing for the opposite ear (sec Chapter 8) unless 
the masking noise is itself carried back, 6y 6one-conduction, to 
the ear which is being tested 

ABSOLUTE THRESHOLD FOR BONE CONDUCTION 

No measurements are available for the energy required to 
attam the threshold for bone-conduction Measurements of 



MECHANISM OF BONE CONDUCTION 


293 


the threshold in terms of amplitude of vibration show that the 
movement necessary to give rise to hearing is extremely small 
Knudsen and Jones give 2 7 X 10~* cm as the threshold ampli 
tude for an aluminum rod applied to the mastoid process when 
vibrating at 1024 cycles Bekesy (12) measured the actual 
movement of the forehead m the immediate neighborhood of 
such a vibrating rod and found its amplitude to be only a few 
per cent of the movement of the rod Correcting for this factor, 
he gives a figure of about 3 5 X 10" 9 cm movement of the skull 
for threshold at 800 cycles The threshold is lowered to 5 X 10“ 1# 
cm by closmg the external canal This figure is almost identi 
cal with the minimal threshold amplitude of movement of the 
eardrum itself determined by a vibrating rod m direct contact 
with the drum (Wilska, see Fig 18, p 56) The relation to 
frequency of the threshold amplitude for bone-conduction has 
been determined directly only for frequencies below 1024, where 
the relation seems to be similar to that for air-conduction 

MECHANISM OF BONE CONDUCTION 
Experimental evidence (Bekesy, 12) appears to indicate that 
both air-conduction and bone-conduction ultimately cause simi 
lar movements of the endolymphatic fluid and of the basilar 
membrane in the cochlea The air borne vibrations arrive by 
way of the ossicles, whereas vibrations of the skull result in com 
pression of the canals of the inner ear, including the labyrinth 
Such compression increases the pressure m the scala vestibuh 
more than in the scala tympani because the round window, m 
communication with the scala tympani, is more elastic than the 
owai '« , whveh is clas/td hy the. footplate, of the. stapes (F vg 
119 B ) Still more important, according to Bekesy, is the fact 
that the semicircular canals communicate with the scala vesti 
bull and, when they are compressed, fluid is forced into the 
scala vestibuli (Fig 119 C) By a senes of ingenious expen 
ments in which bone conducted vibrations were compensated 
by equal and opposite vibrations delivered to the stapes from 
air borne waves and in which the freedom of movement of the 
stapes was altered by raising and lowering the air pressure on 



294 DEAFNESS AND BONE CONDUCTION 



Fic 119 Showing how compression 
of the inner ear by bone-conducted 
sound waves leads to movement of the 
basilar membrane The dotted lines 
indicate the sizes of the various cham- 
bers and the positions of the various 
membranes during compression by a 
sound wave in the skull (Bekfsy, 12 ) 
A Hypothetical case of symmetrical 
compression of the cochlea and equal 
yielding of the membranes of the oval 
and round windows No movement 
of the basilar membrane would occur 
B The round window actually yields 
more than the oval window to equal 
pressures The basilar membrane is 
moved slightly toward the scala tym 
qajw. 

C The semicircular canals are com 
pressed as well as the cochlea Fluid 
forced from the semicircular canals into 
the scala vesobuli causes greater mote 
ment of the basilar membrane into the 
scala tym paai. 


the tympanic membrane, 
Bek£sy demonstrated the 
existence of this type of com- 
pression It constitutes the 
most important element of 
the mechanism of hearing by 
bone-conduction when the 
external canal is open 

When an observer listens 
to a tone by bone-conduction 
and closes the external audi 
tory canal, the tone increases 
in loudness The vibration of 
the skull then compresses the 
air in the external canal and 
the observer virtually hears 
by the usual mechanism for 
air conduction Th c loudness 
may then be diminished by 
increasing the air pressure in 
the external canal, thereby 
tensing the tympanic mem- 
brane and the ligament of the 
footplate of the stapes, just 
as in ordinary air-conduction 
On the other hand, when the 
external canal is open, tensing 
of the tympanic membrane 
and ossicular chain causes an 
increase in loudness of a bone 
conducted tone because it 
fixes 'inc Stapes more ngitfry 
and prevents loss of pressure 
from the scala vestibuh by 
way of the oval window 
Compression of the cavities of 
the inner ear and labyrinth 



MECHANISM OF BONE CONDUCTION 


295 


is here the dominant factor in producing vibration of the basilar 
membrane 

The conclusion that an osseous rather than an osseo-tympamc 
pathway is most important for the transmission of bone-con 
ducted vibrations to the inner ear is confirmed by the animal 
experiments of Guild (3) and of Wever and Bray (7) The 
electrical activity of the cochlea m response to air-conducted and 
to bone-conducted vibrations was measured and then the ossic 
ular chain was interrupted at the mcudostapedial joint Air 
conduction was diminished by 50 to 60 db, but bone conduction 
was not reduced by more than 5 or 10 db 

In human ears a particular osseous pathway is of special lm 
portance This pathway is the bony trabeculae of the subaditus 
region These bony structures are almost directly opposite the 
opening of the scala vcstibuli into the vestibule The impor 
tance of these trabeculae is shown by the demonstration in 
post mortem sections (Guild, 3) of fractures of these trabeculae 
in a number of individuals whose hearing by bone conduction 
had been found to be markedly impaired These fractures 
explain the otherwise paradoxical situation of a person whose 
hearing by air conduction is normal, but who has impaired 
bone-conduction All the cases in Guild s series which showed 
this combination of normal air conduction and impaired bone 
conduction proved to have fractures of all the trabeculae in the 
subaditus region 



CHAPTER 12 

PRINCIPLES OF NEUROPHYSIOLOGY 

From a consideration of the ear as a mechanical system we turn 
now to the study of its behavior as a neuromechanical transducer 
capable of transforming acoustic energy mto nerve impulses In 
order properly to understand this function of the ear, it is ad 
vantageous for us first to review certain fundamental principles 
of neurophysiology The activity of nerves reveals itself most 
readily by the electrical phenomena which accompany it, and 
much of the recent development of the physiology of audition 
has been due to the application of the methods of electrophysi 
ology to the organ of hearing and to its associated nervous 
pathways In this chapter we shall examine the nature of the 
electric potentials generated in living cells 

ELECTRICAL POLARIZATION OF CELLS 
Certain electrical properties are exhibited by all living cells 
Other properties or activities are limited to special types of cells 
whose functions are highly developed and highly specific All 
cells and tissue fluids contain electrolytes in solution and are 
capable of conducting electricity We measure this property in 
terms of electrical resistance Certain limitations are imposed 
on the free movement of ions, partly by the internal structure of 
protoplasm, partly by the boundary membranes of the cells, and 
partly by the organization of cells into definite structures like 
the sheaths of nerve trunks or the dura mater of the brain, whose 
resistance is great as compared with tissue fluids or protoplasm 
As far as we know, all cells are electrically polarized, as 
between the inside of the cell and the external medium This 
statement means that if one electrode is placed on the outer 
surface of a cell and another brought effectively m contact with 
the protoplasm, a difference of electric potential appears between 
the two electrodes This difference of potential can be reduced 
296 



ELECTRICAL POLARIZATION OF CELLS 


297 


by injury and abolished by death It is in part a direct physi 
cochemical expression of differences of composition between the 
interior of a cell and its surroundings, and m part an expression 
of the dynamic activity involved in maintaining these differ 
ences We are ignorant as to the precise mechanisms involved 
in the generation of these bioelectric potentials, but many of the 
important facts are well described by the membrane hypothesis, 
which regards the cell as a solution, different m composition 
from the surrounding medium, and separated from it bj a semi 
permeable membrane The difference of potential which exists 
across this membrane was first described bj Helmholtz as a 



Fig. 120 Diagram of the electrical polarization of a cell membrane One 
electrode is applied to an injured end of the cell and the other to an uninjured 
region The polarization is measured by the potentiometer, P 

double layer of ions — the positive charges outside and the nega 
tive charges mside the membrane This picture has, for dec 
ades, been the starting point of elcctrophysiological theory 
When a cell is injured, the surrounding membrane may be 
partially interrupted or destroyed, or it may be rendered more 
completely permeable The positive and negative charges, 
formerly separated by the membrane, can then unite, and the 
surface is depolarized An electrode placed in contact w ith this 
region is effectively in contact with the interior of the cell (Fig 
120), and the difference of potential recorded between such 
an electrode and one placed on an uninjured portion of the 
surface of the same cell measures the degree of polarization of 
that cell Such a difference of potential is often spoken of as 
an injury potential, since injury is required to penetrate the 
membrane at some point in order to make the measurement 


298 


PRINCIPLES OF NEUROPHYSIOLOGY 


It should be clearly recognized, however, that the potential is 
not generated by the injury, but by the part of the cell which 
remains uninjured The injury is simply an unfortunate neces- 
sity in gaming access to the interior of the cell As long as the 
cell is completely surrounded by its semipcrraeable membrane, 
we cannot measure the potential between inside and out All 
that we can ever observe by electrical measurements is a differ 
encc of potential between two points 

Changes in this physiological polarization are probably asso- 
ciated with many forms of activity, including neural conduction 
and muscular contraction, but it has been difficult to analyze and 
measure the changes of polarization on account of the complies 
tions introduced by injury Even when we burn or crush a 
piece of tissue, the cells may at once begin to heal to rebuild 
their semipermeablc membranes, and to check the diffusion of 
their contents into the surrounding fluids To the extent that 
they succeed, they defeat our experimental purposes Even the 
insertion of microelectrodes into the interior of cells does not 
always overcome this difficulty, for, unless the electrode enters 
a fluid vacuole in the protoplasm, this same process of repair will 
take place If the electrode does enter a vacuole, we face the 
probable polarization of the boundary surface between the 
vacuole and the surrounding protoplasm Difficulties and un 
certainties of this sort have kept changes of polarization a topic 
of interesting controversy for many years 

DISTORTION POTENTIALS 
Two fundamentally different types of transient electrical 
changes may be produced by cells as the result of external stimu 
Iation In the first of these, the cell plays a passive role and 
simply transforms mechanical, or other incident energy, into 
electrical effects which we designate as distortion potentials 
The magnitude of these potentials is a function of the amount 
of distortion produced An example is the wave of electric po- 
tential which accompanies the mechanical wave set up in the 
long plant cell, Nttella by an abrupt mechanical stimulation 
(Osterhout and Hill) 



DISTORTION-POTENTIALS 


299 


There are three general types of physical processes which 
might account for distortion-potentials as we observe them. (I ) 
The mechanical movement or distortion of any charged or 
polarized structure may generate an electric potential according 
to the same principles that underlie the operation of a condenser 
microphone: a change in the distance between two charged 
plates or membranes produces a change of potential. (2) The 
mechanical changes may alter electrical resistance, and so, 
indirectly, current-flow’ and potential. The ‘stretch-effect’ in 
muscle, described by Einthoven, is probably to be explained by 
this type of process. The principle here corresponds to that 
+ _ of the ordinary carbon micro- 



phone or telephone transmit- 
ter. Finally, (3) it is quite 
possible that mechanical dis- 
tortion of a cell, with the gen- 
eration of mechanical stresses 


A B 

Fig 121. An example of the piezo, 
electric effect. The shaded area repre- 
sents the cross-section of a plate cut 
from a hexagonal quartz crystal, whose 
original cross-sectional outline is indi- 
cated by the broken lines. When 
mechanical force is applied, as shown 
by the arrows in A, to the faces of the 
plate, an electric potential-difference 
appears between the edges. If the 
polarity of the force is reversed, as in 
B, that is to say, if the force b a ‘trac- 
tion’ instead of a pressure, the polarity 
of the potential-difference b reversed. 

axes. The piezoelectric effect u 


and strains in a structure con- 
taining oriented molecules, 
may generate differences of 
potential in a manner anal- 
ogous to the familiar piezo- 
electric effect in certain 
inorganic crystals such as 
quartz. This is the principle 
underlying the crystal micro- 
phone: stress on a crystalline 
structure sets up a difference 
of potential along certain 
a quartz plate is illustrated in 


Fig. 121, but it is worth noting that the term ‘piezoelectric’ need 


not be confined to such crystalline structures, since its definition 
is such as to include all electric potentials generated by the 


application of mechanical pressure. Effects of this type are of 


particular concern in our study of the ear, for it appears that 
the hair-cells of the organ of Corti produce a potential (Chapter 
13) in accordance with the principle of the piezoelectric effect. 



300 


PRINCIPLES OF NEUROPHYSIOLOGY 


It is also important to note that in the piezoelectric effect, as in 
distortion potentials in general, the potential generated is essen 
tially proportional to the mechanical distorting force 

ACTION POTENTIALS AND THE ALL OR NONE LAW 

An entirely different type of electrical effect in response to 
stimulation is the action potential, the energy for which 1 $ con 
tnbuted by the cell itself and not by the stimulus In direct 
contrast to distortion potentials, the action potential is all-or 
none in character The stimulus serves merely as a trigger to 
start the reaction The potential generated is not a function of 
the strength of the stimulus but depends entirely upon the 
nature and the immediate condition of the cell which generates 
it The phenomenon is transient in character and forms part 
of a physicochemical disturbance which when initiated at a 
localized point, spreads over the whole of the cell This prop- 
agated disturbance is actually the nerve impulse It may be 
conceived, according to the generally accepted membrane hy 
pothesis, as a wave of increased permeability and depolarization 
which sweeps over the entire polarized membrane The exter 
nal stimulus serves only to start this process at one point, and 
thereafter the wave is self propagating The active region pre 
sumably excites the neighboring inactive region by means of the 
local bioelectric current which flows externally from the (polar 
lzed) inactive region to the (depolarized) active region exactly 
as the injury current flows from an uninjured to an injured 
area (Fig 122) After from one to several milliseconds the 
normal semipermeable state is restored by a spontaneous proc 
ess of recovery and repair 

Smce the energy for this action-current, or action potential, 
comes from the tissue, and the external stimulus serves only as a 
trigger to set it off by upsetting the equilibrium in a meta stable 
system, it is not surprising that the nerve impulse and its action 
potential are all-or none in character The magnitude of the 
potential, like the amount of chemical action and heat produc 
tion associated with the impulse, is not a function of the 
strength of the stimulus The stimulus either excites a cell 



EXCITABLE i RELATIVE REFRACTORY ' ABSOLUTE; EXCITABLE 
FIBER i PERIOD iREFRACTORV FIBER 

PERIOD 

Fig 122. Diagram illustrating the membrane-theory o£ conduction of the 
nerve impulse. The polarized sermperm cable membrane (solid outline) be- 
comes depolarized and permeable in the acme region The current flow in 
the local bioelectric circuits, represented by the curved arrows, depolarizes the 
region m advance of the active zone and thus extends the region of activity. 
In the region of recovery the nerve is temporarily refractory. A potentiometer, 
P, connected to active and inactive regions registers the relative electrical 
negativity of the active region. 

excited again, but a stronger stimulus is required to reach thresh- 
old and evoke a response; and the action-potential is smaller 
than it is v/hen the cell is fully recovered. We may compare 
the recovery process to the recharging of a storage battery or an 
electric condenser. 

When stimulation is repeated at a temporal interval so brief 
that each impulse is set up in the relative refractory period of 
its predecessor, the successive action-potentials become smaller 
and smaller, until a steady state is reached at which the proc- 
esses of recovery and restitution just balance the dissipation of 
energy in successive impulses. The higher the frequency of 



302 


PRINCIPLES OF NEUROPHYSIOLOGY 


stimulation, the lower is this equilibrium level to which the 
size of the action potential sinks This process of equilibration 
—the attainment of a dynamic equilibrium between restoration 
and dissipation — is a type of fatigue It is well illustrated in 
the auditory nerve (Chapter 16), and is characteristic, in general, 
of the active, all-or none type of response It should be clear 
from the above description that the magnitude of the electric 
response of nerve or muscle is not invariable, although the all 
or none law is often misinterpreted in this sense Refractory 
period, equilibration and fatigue, all of which are characteristic 
of the all-or none type of response, are not found in the passive 
types of reaction described as distortion potentials 

An interesting situation in regard to the all or none law is 
presented by the chemical activation of smooth muscle (Cannon 
and Rosenblueth) Nerve impulses which are all or none m 
character liberate, at the nerve-endings, chemical substances 
such as acetylcholine or sympathin These chemical mediators 
initiate mechanical contraction and electrical disturbances in the 
muscle fibers which involve the liberation of energy by the 
muscle itself The reaction of the muscle fibers is not all-or 
none, however, except as liberation of the mediator is dependent 
upon an all or none nerve impulse Furthermore, in smooth 
muscle there is apparently no propagated disturbance equivalent 
to the electrochemical type of nerve impulse Instead, conduc 
tion here occurs by simple diffusion of the chemical mediator 
from the region where it is liberated It appears probable that 
similar chemical transmission also occurs in the synapses of the 
central nervous system (see also p 391) 

The gray matter of the brain exhibits complex electrical 
activity (see Chapter 18) It is not yet established to what 
extent this electrical activity represents all-or none reactions 
such as nerve impulses and to what extent it is based upon 
processes more nearly analogous to those in smooth muscle It 
is also possible that the cortical potentials are generated m part 
by the chemical processes of recovery following activity Such 
after potentials which can also be detected m the nerve fiber 



THREE PRINCIPLES AIDING STUDY OF CELLULAR POTENTIALS 303 


following the passage of a nerve-impulse, are not strictly ali-or- 
none in character. 

THREE PRINCIPLES AIDING THE STUDY OF 
CELLULAR POTENTIALS 

In making applications of classical electrophysiology to struc- 
tures such as the inner ear, one point must be emphasized 
Classical electrophysiology has studied certain cells or parts of 
cells, particularly skeletal muscle and the axons of neurons 
These cells constitute a special case that of long cells which 
can be injured at one point and still remain relatively normal 
and active at another point These are cells in which activity 
may spread as a wave from point to point, and the periods of 
activity at different regions may be separated in point of time 
The length of the cell makes possible large differences in activity 
from end to end On the other hand, cells which are small 
and cells which are symmetrical are far more difficult to study 
electrically 

Smooth muscle, composed of cells which are small relative 
to the organ which they constitute, and relative to any ordinary 
electrodes, shows electrical changes associated with activity, but 
the interpretation of these changes is by no means as simple as 
in the skeletal muscle In fact, in smooth muscle it seems extraor 
dinary that we can detect systematic electrical changes Such 
changes imply, m the first place, that an electrical asymmetry 
appears m the individual cells If all points of the surface of a 
cell underwent the same change at the same time, there would 
be no loss of symmetry and no change in the external electrical 
field m the neighborhood of the cell There must be, in this 
case, an asymmetry of the electrical changes in the individual 
cell Furthermore, since the individual units are myriad in 
number, there must be some systematic orientation of these 
units, as in the electric organ of Testudo, so that the electric 
field which is produced by one cell is not canceled by that of 
another In other words, even granting the production of an 
electrical disturbance by each individual cell, we must infer a 



304 


PRINCIPLES OF NEUROPHYSIOLOGY 


more or less systematic orientation of these cells if the disturb- 
ances are not to be statistically evened out to a dead level 
These two principles of asymmetry and of orientation of the 
electrically active units m ussue arc provided anatomically in 
most tissues hitherto studied. They are apparently present m 
others, such as the gray matter of the nervous system, smooth 
muscle, gland cells, and, as we shall see, the organ of Corti 

A third principle facilitating the detection in a tissue of the 
activity of its component cells is synchronization This pnn 
ciple has long been appreciated in the skeletal muscle, and even 
more so in the nerve The synchronized action potentials of 
the individual fibers, following an electrical stimulus applied 
to a large nerve, can be detected easily with an unaided string 
galvanometer, but the detection of random, asynchronous ac 
tivity in the fibers of that same nerve requires the aid of amphfi 
cation Fortunately for the electrophysiology of audition, the 
principle of synchronization is automatically fulfilled to a large 
extent by the nature of sound Wc shall consider in Chapter 
16 how each sound wave tends to set up a synchronized volley 
of nerve impulses Synchronization makes it possible to study, 
with comparative ease, the activity of auditory pathways buried 
deep in the substance of the midbrain, and the synchronization 
of the sensory cells of the inner ear, when they generate their 
tiny electric fields in response to sound waves, gives rise to a 
microphomc action of the cochlea as a whole 

CONDUCTION IN NERVES 

The all-or none law of the nerve impulse has a profound 
significance for psychophysiology All nerve impulses are fun 
damentally similar, although their speed, whether measured as 
velocity of conduction or as the rate of electrical change at a 
grveii purwi, sysecnri{e^)i\ fron. Sfc/w vz SJws v* -u i'Uia. 
tion of size, but the rate is constant for a given fiber Only one 
kind of nerve impulse has ever been detected It is true that 
the peripheral portion of a nerve fiber dies when severed from 
the cell body, and we may infer from this a trophic influence 
proceeding from the cell body This must be of the nature of 



CONDUCTION IN NERVES 


305 


the spread of chemical material, and its velocity has been meas- 
ured by Parker and Paine m terms of centimeters per day 
Obviously this effect can bear no relation to the problem of 
sensory perception 

The only other serious suggestions of conduction of non- 
lmpulsive influences along nerve fibers are contained in the 
observations on electrotomc effects described by Barron and 
Matthews, m the theory of “chronaxie de subordination” of 
Lapicque, and in the “retrograde influence” of Rosenblueth 
and Ortiz It is a well established fact that a difference of 
electric potential established along a nerve fiber gives rise to an 
extra polar electric gradient The normal polarization of the 
axon is increased or diminished for a short distance away from 
the electric source This effect is known as electrotonus Barron 
and Matthews found that some afferent fibers in the spinal cord 
show intermittent failure to conduct impulses, a transient block 
to conduction, which is best explained as due to a purely physical 
spread of polarization from the electrically active gray matter 
of the spinal cord along collateral branches to the mam axons, 
there producing an ‘electrotomc block ’ 

The phenomenon of subordination’ is apparently another 
manifestation of changes in polarization which induce altera 
tions of excitability, velocity of conduction, and so on There 
is still some doubt as to the validity of the fundamental observa 
tions and still more as to their interpretation, but for our \m 
mediate argument it is significant that these changes are sup 
posed to be induced in one neuron by the activity of another, 
and that the effects emanate from a hypothalamic center toward 
the periphery There is no suggestion that peripheral stimula 
Don of sense-organ might alter the polarization or degree of 
subordination of a center except by the transmission to it of 
impulses, even though the theory might allow changes in sen 
sory threshold due to central modification of the properties of 
an afferent nerve The theory of subordination, important as 
it may become for psychophysiology if finally established upon 
an adequate experimental basis, must as yet be regarded with 
open minded skepticism 



306 


PRINCIPLES OF NEUROPHYSIOLOGY 


The retrograde influence of Rosenblueth and Ortiz is pos- 
tulated by a process of exclusion, in order to explain certain 
central effects following the cutting or blocking by local 
anesthesia of motor fibers in the phrenic nerve The expen 
ments make it difficult to avoid this assumption of a property 
of neurons which differs from the properties manifested in the 
process of conducting nerve impulses, but the effect appears 
only after interruption of functional activity of the peripheral 
axon, and its nature is entirely obscure 

We remain, therefore, with the proposition that the only 
rapid change which proceeds centrally along an afferent nerve 
fiber is the nerve impulse This impulse is all-or none in char 
acter, and, although its intensity and time course may vary 
within limtts as a function of fatigue, these limits arc definitely 
established by the morphology of the fiber 

RELATION OF NERVE IMPULSES TO SENSATION 

The afferent nerve from a sense-organ, such as the eye or 
the ear, represents a bottle neck through which all the variety 
and gradations of sensation of the entire sense modality must 
pass The number of physiological variables in the afferent 
nerve are strictly limited There is significant freedom in the 
matter of (1) how many fibers and of (2) which fibers in the 
nerve may carry impulses, and of whether (3) one or several 
impulses pass up each fiber and, within limits, (4) at what fre 
quency There is an upper limit to the possible frequency at 
about 1000 per second imposed by the recovery or refractory 
period of the nerve fiber There is no variation in the size of 
the impulses as a function of the intensity of the stimulus Such 
variations m size as occur during equiltb ration or fatigue arc a 
function of frequency and of duration of activity, and hence the 
variation in size is not an independent variable There is also 
freedom in (5) the temporal relations of impulses in different 
fibers, since each fiber is functionally quite independent of its 
neighbors 

These, then, are the physiological dimensions of sensations 
as they pass along the afferent nerve, and it is the task of 



PROPERTIES OF SYNAPSES 


307 


psychophysiology to examine the activity of the cochlea and of 
the auditory nerve and to compare them with the corresponding 
auditory sensations in order to ascribe, as far as possible, the 
correct physiological dimensions to each dimension of auditory 
sensation 

The problem of the psychophysiology of sensation is partly, 
but not completely, solved by such an analysis m terms of nerve- 
impulses. The function of nerve-fibers is purely that of con- 
duction. The message is coded, so to speak, by the sense-organ, 
dispatched on the nerve-fibers, delivered promptly by them at 
another station, and there decoded and related to other messages 
from other stations Our first problem is to detect the messages 
in transit and learn their code This task is relatively easy, 
since we believe we know all of the possible variables in the 
code and can tap the wires with fair efficiency. The next prob- 
lem is the interpretation of activity at the synaptic centers 
This is not so easy, since we do not yet know the possible vari- 
ables in the function of cell-bodies and synapses The anatomy 
is far more complicated when synapses are involved; a new set 
of physiological laws confronts us and implies a new set of 
physiological variables underlying them. In terms of these 
new variables the sensory messages are rewritten, and, when we 
tap the centers electrically, a new pattern of activity confronts 
us. The centers seem to code their messages in a new language 
and we are still seeking the clue to its translation. 

PROPERTIES OF SYNAPSES 

The differences between synaptic or central transmission and 
Kvutesxets&va. in tVa ntrti-taafflk Vwwt tatn dtsYitd tfcwtfiy 
the study of spinal reflexes. They may be summarized briefly 
as follows: 

1. One-way conduction in the synapse. The activity passes 
from axon across synapse to dendrite or cell-body, but not in 
the reverse direction. In the nerve-trunk the impulse can travel 
equally well in either direction. 

2. Greater sensitivity of the synapse to adverse conditions. 
Anesthetic drugs, fatigue, and lack of oxygen, in degrees or con- 



308 


PRINCIPLES OF NEUROPHYSIOLOGY 


centrations easily tolerated by the nerve trunk, stop synaptic 
conduction 

3 Delay in conduction through the synapse At least 05 
msec and sometimes as many as 2 or 3 msec are required for 
transmission across the microscopic synapse, whereas the veloc- 
ities are from 1 to 100 meters per second along axons 

4 Summation of impulses at synapses Although it is 
possible that a single impulse in an axon may activate the next 
neuron beyond a synaptic junction, it is usually necessary for 
several impulses to arrive either (a) in succession over the same 
axon, giving temporal summation or (b) nearly simultaneously 
over different axons, giving spatial summation In the axon a 
single impulse once started traverses the entire axon 

5 Inhibition at synaptic junctions The arrival of impulses 
at certain synapses does not tend to excite the next neuron, hut 
instead tends to suppress any activity which may be in progress 
and make the neuron more resistant to excitation by impulses 
arriving over other pathways The nearest analogy to this in 
the axon is the electrotonic block of impulses discussed earlier 
in this chapter Even this effect seems to depend upon the 
proximity of synaptic centers 

6 Spontaneous aettvity of gray matter. Present evidence 
suggests that certain brain cells are normally in a state of 
rhythmic activity, sending a series of impulses along their axons 
even in absence of stimulation by afferent impulses This, like 
many of the properties here listed, may well reside in the cell 
body rather than in the synapse proper, which is the anatomical 
junction between two neurons, but for our purposes there is no 
need of distinguishing between the two The contrast lies 
between the conduction of impulses by peripheral axons and 
the new properties introduced by axon terminations, synapses, 
dendrites, and cell bodies in the gray matter of the central nerv 
ous system Spontaneous rhythmic discharge is not seen in the 
axon except for a brief period following an acute injury, or 
m an abnormal chemical environment 

The modifications of conduction introduced by the synapses 
or cell bodies may well be related to the high rate of metabolism 



PROPERTIES OF SYNAPSES 


309 


of nerve-centers, as compared with the greater economy of the 
fibers, and to the probable chemical mode of transmission be 
tween neurons as opposed to the electrochemical mechanism of 
the impulse At present we are still seeking to understand the 
nature of central action and are not yet prepared to undertake 
psychophysiological correlations with either chemical or elec 
trical events occurring in the gray matter of the nervous system 



CHAPTER 13 


THE MICHOPHONIC ACTION OF THE 
COCHLEA 

The electrical activity of the cochlea has been employed as a 
tool in the analysis of both physical and physiological problems 
The original impetus to this type of work was given by Wevcr 
and Bray (1, 2) when they reported that, after placing electrodes 
upon the medulla or the auditory nerve of a decerebrate cat, 
they were able, by listening with telephone receivers to the 
amplified signals, to recognize, not only pure tones used as 
stimuli, but even words spoken to the cat This observation 
immediately attracted the attention of both physiologists and 
psychologists, and its verification and extension were under 
taken in several laboratories 

HISTORICAL 

The presence of action potentials in the eighth nerve and 
the brain stem was taken for granted even before it was directly 
demonstrated by Buytendijk in 1910 Buytendijk, in that 
year, with merely a strmg galvanometer, recorded the action- 
current of the auditory nerves of rabbit and guinea pig in 
response to a pistol shot Seventeen yeais later, Forbes, Miller, 
and O’Connor, using a string galvanometer and one stage of 
amplification, detected responses from the medulla of the 
decerebrate cat Ml response to sudden sounds Synchronization 
with rapidly repeated clicks was observed up to 200 per second, 
but the limitations of the recording device prevented analysis 
at higher freauencies 

Then Wever and Bray in 1930 reported the reproduction 
of speech in the auditory nerve They proved that the effect 
was truly biological and not a mere physical artefact In their 
experiment we also encounter the first detection of the mtcro- 
phonic action of the cochlea— a true microphomc action, dif 
310 



THE ELECTRICAL ACTIVITY OF THE COCHLEA 


311 


ferent from the action potentials of the auditory nerve Wever 
and Bray did not suspect the dual nature of their electrical 
phenomena, however, and interpreted them exclusively m terms 
of action potentials Adrian suggested a “microphomc action 
of the cochlea” and gave considerable experimental evidence in 
support of this view, but in collaboration with Bronk and 
Phdlips he apparently reversed his judgment, and tacitly as- 
sumed the existence of only one type of response true action 
potentials Saul and Davis pointed out the distinction between 
the two types of response by showing that one of them (the 
action potential) is limited to relatively low frequencies, is easily 
suppressed by anesthetics, and is localized in the auditory path 
ways and nuclei of the medulla, whereas the other (the aural 
microphomc) occurs at high as well as low frequencies, is 
resistant to anesthesia and death, and is generated in or near 
the cochlea Meanwhile Wever and Bray (4, 5) extended 
their studies to other animal forms All mammals studied 
showed effects very similar to those shown by the original cats 
Turtles reacted similarly to low frequencies, but were unrespon 
sive to high tones, whereas insects gave only asynchronous 
nerve unpulses (see, however, p 399) 

THE ELECTRICAL ACTIVITY OF THE COCHLEA 

The microphomc action of the cochlea, or cochlear response, 
as it has previously been called denotes the generation in the 
cochlea of electric potentials when sound waves activate the 
ear These potentials reproduce the frequency and the wave 
form of the sound waves throughout the range of audible fre 
quencies, and their latency with respect to the sound wave is 
less than 0 1 msec — perhaps much less They are not to be 
confused with the action potentials of the auditory nerve, which, 
as we shall see, show synchronized response to only a restricted 
range of lower frequencies The wave form of the action 
potentials may differ considerably from that of the stimulating 
sound, and their latency is at least 07 msec Action potentials 
fall in the all-or none, self propagating category of disturbance 
described in Chapter 12, whereas the microphomc action of the 



312 THE MICROPHONIC ACTION OF THE COCHLEA 

cochlea is apparently a distortion potential and represents the 
transformation of the mechanical energy of the sound wave 
into electricity without further contribution of energy by the 
tissue It is to emphasize this distinction that we here employ 
Adrian’s phrase “the microphomc action of the cochlea’ rather 
than ‘cochlear response ’ since response has acquired a physiolog 
ical connotation suggesting the liberation of energy in some 
specific manner, as in the nerve impulse or muscular contrac 
tion Another apt designation of this effect is cochlear or aural 
microphomc 

There is no doubt that the microphomc action occurs in the 
cochlea, although it may be detected from almost any part of 
the head when sufficient amplification is used The signals 
arc strongest, however, if the petrous bone is included between 
the two electrodes and best of all when one of the electrodes 
is placed in contact with the round window, or, m the case of 
the guinea pig, with the apex of the cochlea Contact can be 
made to one of these points with a cotton wick moistened in 
salt solution, and the circuit completed through an indifferent 
electrode on the muscles of the back of the neck The potential, 
as measured at the round window, may amount to a maximum 
of approximately 1 millivolt when the ear is stimulated by a 
loud pure tone If the tympanic membrane is damaged, or 
the ossicular chain interrupted, the microphomc action is dimin 
ished, but it can still be obtained when the sound waves are 
carried to the cochlea by bone-conduction 

Post mortem Activity When the experimental animal is 
more and more deeply anesthetized, or when it dies during the 
experiment, the microphomc action is only slightly affected 
until the circulation fails The potentials then fall to a low 
level Ligation of the carotids and compression of the vertebral 
artery causes a similar reduction to a value of from 5 to 20 per 
cent of the original strength within 2 or 3 minutes Occasion 
ally, with failure of the circulation, the potentials almost com 
pletely disappear within a very short time More frequently, 
however, they persist at a low level, continuing to fall slowly 
for from one to several hours The exact moment of final extinc 



THRESHOLD OF THE COCHLEAR MICROPHONICS 


313 


tion is difficult to determine, for it depends on the degree of 
amplification and the strength of stimuli employed So far 
as it has been investigated, this post mortem activity appears 
not to differ in any significant way, other than in magnitude, 
from the normal ante mortem activity If the depression has 
been brought about by compression of arteries, the effect is 
reversible the microphonic action returns upon re admission 
of the circulation TTiese facts assure us that the piezoelectric 
effect m the cochlea does not depend simply upon frictional 
effects of endolymph moving in its channels or upon the vibra 
tion of ossicles and membranes Cooling the cochlea by plac 
ing ice m the bulla, or on the petrous bone, causes a similar 
partial depression without significant change in the upper limit 
of frequency This is another important point differentiating 
ihe microphonic effect of the cochlea from the action potentials 
of the auditory nerve 

THRESHOLD OF THE COCHLEAR MICROPHONICS 

If, as appears probable, the cochlear microphonic is a true 
piezoelectric effect in which the potential generated is propor 
tional to the distorting force applied to the hair cell, we should 
expect to find no ‘threshold’ for this effect in the sense that we 
find a threshold for the excitation of a nerve fiber A fiber 
reacts in an all-or none fashion as soon as a stimulus achieves 
a certain finite value — the threshold value The cochlear 
microphonic, however, is a continuous function of the intensity 
of the stimulus Therefore, the lowest value of the stimulus 
which will generate a cochlear microphonic is determined only 
by the ultimate quantal nature of electricity With available 
techniques we cannot, of course, follow the microphonic effect 
to these very small values, and so we designate as ‘threshold’ 
the lowest values which we can conveniently measure 

The relation of a just-detectable microphonic effect to the 
energy and frequency of the stimulating sound resembles rather 
closely the human audibility curve, particularly when the latter 
is determined by the same stimulating apparatus (Fig 123) 
A ‘just-detectable effect’ is, in practice, about 1 microvolt when 



314 


THE MICROPHOMIC ACTION OF THE COCHLEA 


a cathode-ray oscillograph is employed as indicator, or a little 
less when measured with a sharply tuned circuit, such as a wave- 
analyzer. Such a ‘threshold effect’ is produced in guinea-pigs 
by the least sound-energy when the frequency lies between 700 
and 2000 cycles. The sensitivity is less for both higher and 



FREQUENCY 


Fig 123 Average threshold-curves from 17 normal guinea pigs and 8 
human ears The threshold for the cochlear microphomc was taken as the 
electrical input to the loud speaker which yielded a just visible wave of about 
1 microvolt on a cathode ray oscillograph The human ears were tested by 
placing the speculum through which sound was delivered to the ear of the 
guinea pig into the observer’s external meatus and requiring the observer to 
report the presence or absence of the tone Only observers under 30 years of 
age were used (Stevens, Davis, and Lurie ) 

lower frequencies, by an amount depending on what part of 
the cochlea is nearest to the recording electrode. The signif- 
icance of this fact will appear later. The absolute sensitivity of 
the animal appears to be a little less than that of the human 
observer, but it should be remembered that the animal’s ‘thresh- 
old* is here taken as an arbitrary potential, dependent upon 
the resolving power of an amplifier. This just-detectable effect 
may well be above the threshold for activation of the most sensi- 




LIMITS OP FREQUENCY 


315 


live nerve fibers, since Kemp, Coppee, and Robinson frequently 
observed a threshold for action potentials in the medulla some 
10 db below the round window ‘threshold’ which was measured 
simultaneously in the same animal With this correction, the 
threshold functions for animals and human observers would 
comcide very closely indeed In fact, the electrical audiogram 
of the most sensitive animal preparations is nearly supenrapos- 
able on the average curve for the normal human ear The 
systematic irregularities at 250 and 700 cycles in the curve for 
the guinea pig m Fig 123 are almost certainly due to inter 
ference by action potentials — a fact to be considered later 

LIMITS OF FREQUENCY 

At very high and at very low frequencies, more and more 
sound energy is required to produce a detectable microphonic 
effect This makes it impossible to speak with precision about 
upper and lower limits of frequency As better sound systems 
and better recording amplifiers have been employed, the re 
ported upper limit of response has risen from 4100 cycles, 
originally mentioned by Wcver and Bray, to more than 16,000 
cycles Here the limit still appears to be fundamentally instru 
mental, since the voltage of the microphonic is small and high 
sound intensities are necessary It seems quite reasonable to 
assume that the microphonic persists at least as far as the upper 
limit of hearing 

The lower limit presents a similar problem, for the necessary 
intensity of sound rises progressively as the frequency is reduced 
Figure 123 shows, however, that if the electric potential is 
measured at the apex instead of at the round window the 
threshold curve rises more slowly at low frequencies Very 
slow waves of less than 1 per second are often seen to be cor 
related with the pressure changes produced by movements of 
the door of the experimental room, and also by contraction and 
relaxation of the intra aural muscles (H C Wiggers) In 
their investigation of the effects of low tones, Wever, Bray, and 
Willey demonstrated cochlear microphonics in response to fre 



316 


THE M1CR0PH0NIC ACTIOV OF THE COCHLEA 


qucncies as low as 5 cycles Ultimately, however, a limit must 
be reached when pressure differences between the scala vcstibuh 
and the scala tympam are equalized through the hehcotrema 
as fast as they are produced Nevertheless, it is perfectly clear 
that the lower limit of the microphomc is far below the 20 cycles 
which is conventionally assigned as the lower limit of pitch 
perception 


WAVE FORM 

The wave form of the cochlear potential corresponds fairly 
closely to that of the stimulating sound wave The complex 
waves of human speech are reproduced accurately enough to 
allow listeners to recognize the speaker by the quality of his 
voice Although a pure sinusoidal sound wave is reproduced 
under favorable circumstances as a sinusoidal electric wave, 
there are two major limitations to the perfection of the reproduc 
tion In the first place, the intensity of stimulation must not 
be too great, else higher harmonics will distort the wave, as 
shown m Fig 124 This distortion undoubtedly represents the 
‘subjective’ or, better, aural harmonics which have long been 
familiar to psychologists and which have been considered in 
Chapter 7 In the second place, we often see at frequencies 
below 1000 cycles a notch or hump in the main wave (Fig 
124) This latter distortion can also be measured in terms of 
harmonics, mostly second and fourth Its basis is not, how 
ever, a mechanical nonlinear distortion, but merely the presence 
of the action potential of the auditory nerve The nerve fibers 
discharge more or less synchronously once during each cycle, 
and an electrode on the round window or apex records their 
action potentials simultaneously with the microphomc wave 

There is a constant latency of approximately 07 msec be 
tween the arrival oi a sound wave at fne basiiar merdurane and 
the appearance of the corresponding action potential in the 
nerve Consequently, as the frequency is changed, the hump 
due to the action potential shifts phase relative to the micro- 
phonic wave At frequencies near 1000 cycles the action 
potential may merge quite smoothly with the succeeding micro- 



WAVE FORM 


317 


phonic wave and be scarcely noticeable. At higher frequencies, 
the action-potentials recorded from the auditory nerve become 
much smaller and finally asynchronous (see Chapter 17), so 
that the distortion of the wave due to this cause becomes negligi- 


1500 I 
IBdbI 


550 I 
-ISobI 


300 

■2008| 


A/\A|AA/n 


VW 



Fic 124 Standing wave oscillograms recorded from the round window of 
a guinea pig, and (lower right) a control oscillogram of sound waves recorded 
by a crystal microphone. The swcep^ircutt of a cathode ray oscillograph is 
synchronized with the sound waves. The luminous spot then traverses the 
same path repeatedly and produces a standing wave pattern which is photo- 
graphed 

The 1500-cycIc record shows a nearly sinusoidal wave-form At 550 cycles 
the action potential appears as a notch at the peak of the microphonic wave 
At 300 cycles and at 240 cycles the action potential stands in a different phase- 
relation to the microphonic. The cochlear microphonic is nearly maximal m 
the first four records In the fifth (240 cycles at 0 db) the sound intensity is 
supramaximal and the cochlear microphonics show the strong aural harmonics 
which are introduced by nonlinear distortion in the middle and inner ear 
The lower right hand record, taken by a crystal microphone, shows the purity 
sif. i A. ft (TV/t TtSstrnct ft ftb, vs •appimvrravtVj 

db above human threshold at 1000 cycles ) 


blc. It is plain, then, that the admixture of action-potential 
interferes with exact measurement of the cochlear microphonics 
at certain frequencies, but, with this limitation, it seems safe 
to conclude that the cochlear microphonic reflects with great 
accuracy the time-course and wave-form of the mechanical dis- 
turbance within the cochlea. 






318 


THE MICROPHONIC ACTION OF THE COCHLEA 


POLARITY AND PHASE RELATIONS 

The polarity of the microphomc wave depends upon the 
location of the exploring electrodes If we record the poten 
tial between the round window, which is electrically continuous 
with the scala tympani, and an indifferent electrode on the 
neck, we find that the development of positive external pressure 
on the tympanic membrane causes the round window to become 
electrically negative (cf Fig 133, p 342) Negative pressure 
causes it to become electrically positive It is of some theorcti 
cal importance that the initial electrical change may be either 
positive or negative from the resting potential, and is not 
restricted to a single polarity, as in the case of the nerve impulses 

As the round window becomes negative, the oval window 
and stapes, electrically continuous with the scala vestibuh and 
scala media, become electrically positive, and, in the guinea pig, 
the apex of the cochlea, which is easily accessible and is separated 
from the scala vestibuh and scala media by only a thin shell of 
bone, also becomes more positive The opposite sign of the 
electric potential at apex and at round window implies that 
the two sides of the basilar membrane develop opposite electric 
charges in response to mechanical pressure on the tympanic 
membrane 

Efforts to determine more accurately the phase relation 
between the potentials at round window and apex showed that 
it shifted somewhat as a function of frequency (Stevens and 
Davis) At 60 cycles the two potentials were almost 180° out 
of phase, but the difference approached 90° as the frequency 
was increased to 4000 cycles The individual measurements 
varied considerably, however, under the method used The 
interpretation of these phase-differences is uncertain, partic 
ularly when the differences are other than 180° The presence 
of a complex electric impedance of the sort embodied in a'fl 
living tissue (Stevens and Davis) may contribute to them, or 
they may also be an expression of actual differences of phase 
in the mechanical events toward the two ends of the basilar 
membrane (cf Fig 118, p 285) 



RELATION TO SOUND INTENSITY 


319 


RELATION TO SOUND INTENSITY 


The voltage of the cochlear potential increases as a continu- 
ous function of the intensity of the stimulating sound No 
evidence has appeared to indicate any step-like additions of all 
or none units, as in neuromuscular activity At low intensities, 
provided disturbing factors are eliminated, the potential is 



Fig 125 The voltage of the cochlear microphonia is plotted as a function 
of intensity in linear and also in double logarithmic coordinates The units 
of the linear scales arc arbitrary Corresponding points on the two curves 
are connected by broken hues to show what a small part of the upper curve 
is represented by the long straight portion of the lower curve These micro- 
phonia were obtained from the round window of a guinea pig and measured 
with a sharply tuned wave-analyzer (Data from Newman Stevens and 
Davis) 


directly proportional to the amplitude of the stimulus At 
higher intensities the curve becomes concave toward the inten 
sity axis, as shown by the upper cun e of JFig 125 The cochlear 
potential later passes through a maximum, and, at very high 
intensities, declines markedly The deviation from the linear 
relationship usually occurs when the cochlear potential has 
reached about 20 per cent of its ultimate maximum When the 
voltage of the cochlear microphomc is plotted against the 
logarithm of the sound intensity (decibel scale), the function 



320 


THE iflCROPHONIC ACTION OF THE COCHLEA 


appears as a sigmoid curve, as in Fig. 129. The data may also 
be plotted as the logarithm of the microphonic against the 
logarithm of the sound-intensity. The linear relationship at 
low intensities then appears as a straight line with a slope of 
1.0, as shown by the lower curve in Fig 125, This form of 



Fig 126 Variations in the relation between \oItage of the cochlear micro- 
phonic and sound intensity Double logarithmic coordinates 
A — from the apex of a guinea pig’s cochlea Frequency 1500 cycles. This 
slope of 1 00 at low intensities is the ideal case, 

B — from the apex of a guinea pig’s cochlea Frequency 1500 cycles 
C— from the round window of a guinea pig Frequency 1000 cycles 
Irregular curve due to interference by action potentials 
The data for curves A, B, and C were obtained by means of a sharply tuned 
wave-analyzer (Stevens and Davis, unpublished) 

D— from the round window of a pigeon Frequency 5000 cycles (Data 
from Wever and Bray, 8 ) 

Curves A, B, and C are shown u> the correct relation to one another with 
respect to intensity and to voltage Curve D is placed arbitrarily in what is 
judged to be its proper relation to the other three curves 

representation has been widely used, but it should be noted 
that, owing to its logarithmic character, it exaggerates the im- 
portance of the small potentials at low intensities. It is for 
just these small potentials that measurementsaremost uncertain 
The slope of the straight portion of the curve in the double 



RELATION TO SOUND INTENSITY 


321 


logarithmic plot seems to approach 1 0 as a limiting value 
Deviations occur (Fig 126) in the direction of a lesser slope, 
as low as 0 625 in the guinea pig (Stevens and Davis, unpub 
lished) and as low as 035 in the pigeon (Wever and Bray 8) 
More rarely the slope is significantly greater than unity The 



Fig 127 Typical family of voltage intensity curves obtained at various 
frequencies (parameters) from the round window of a guinea pig The 
voltages were measured with a sharply tuned wave analyzer Most of the 
curves show slopes of less than 1 00 and many of them are si ghtly sigmoid 
Curve C in Fig 126 belongs to this same family (Stevens and Davis un 
published ) 

slope of unity appears to represent the ideal case The straight 
ness of the line is also a limiting or ideal case Very often there 
is a systematic deviation in the direction of a long flat sigmoid, 
as shown in Fig 127 Sometimes deviations are abrupt, sigrnf 
leant, and reproducible, as illustrated by curve C in Fig 126 
Such abrupt deviations are almost always associated with a 
distorted wave form, revealed by the cathode ray oscillograph 
The distortions in question appear at low intensities, far below 
the level at which mechanical nonlinear distortion might reason 
ably occur, and several tests prove that they are due to the 
presence of an action potential component No ready means 



322 THE MICROPHOMC ACTION OF THE COCHLEA 

has yet been devised for eliminating these action potentials, 
although partial asphyxia and cooling may reduce them con 
siderably The action potentials are not sinusoidal, and, de 
pending on their phase relation to the cochlear potential proper, 
they may add to or subtract from the latter This is true 
whether we measure peak voltage on the cathode ray oscillo- 
graph or the root mean square alternating voltage of the fun 
damental frequency, with a tuned frequency analyzer or filter 
system The action potentials also introduce even harmonics 
which may be measured by an appropriate analyzer 

We shall see (Chapter 16) that the law of increase of the 
action potential may be approximately, but not exactly, the 
same as that of the cochlear potential Addition of nerve 
potential to cochlear potential, therefore, should, and appar 
ently does, introduce more or less systematic deviations from 
any simple law 

Another physiological factor may mask the simple relation 
of cochlear microphomc to sound intensity The muscles of 
the middle ear contract reflexly in response to sounds, and the 
louder the sound the stronger the contraction (Lorente de No, 

1) Now, increased tension of these intra aural muscles re 
duces the transmission of sounds The reduction is very signifi 
cant for low tones, but absent for frequencies above 2000 cycles 
(see Fig 110, p 267) For low tones, therefore, there is a 
reduction of the cochlear microphonics which is a function of 
the intensity of the stimulus This fact tends to make the 
microphomc curve concave toward the axis of intensity The 
reflex in question is depressed by deep anesthesia (Hallpike, 

2) and in consequence has, apparently, escaped the attention of 
several investigators, but it must be considered m any study of 
the unanesthetized animal or normal human being 

RELATION OF MAXIMAL COCHLEAR POTENTIAL 
TO FREQUENCY 

At low intensities of stimulation the magnitude of the 
cochlear potential increases in linear relation to the intensity 
of the stimulus This does not continue indefinitely, for with 



RELATION OP MAXIMAL COCHLEAR POTENTIAL TO FREQUENCY 323 


stronger stimuli the voltage of the cochlear microphonic in- 
creases more and more slowly and ultimately passes through a 
maximum. When the intensity of the stimulus is plotted on 
a logarithmic (decibel) scale, this maximum is rather sharp 
(Fig. 129). Plotted on an arithmetical scale, however, it ap- 
pears as a long plateau (Fig. 125). 

The intensity at which the maximal potential is developed 
can be determined for each frequency. This intensity does 
not vary greatly as a function of frequency and is some 90 db 
above the threshold of the microphonic at 1000 cycles. The 
sound-intensity which produces maximal potential is of the 



Fig 128 The maximal voltage obtained from the round window of the 
cat (average of 12 ears) is plotted, as a function of frequency, against a hnear 
scale The sound intensity necessary to attain maximal voltage is plotted in 
decibels (after Covell and Black) The threshold of feeling m man is also 
plotted m decibels for comparison (Wegd, 2) 

same order of magnitude as, although slightly lower than, the 
intensity at the threshold of feeling in human ears (see Fig. 128). 

The maximal voltage obtainable from the cochlea is a func- 
tion of the frequency of stimulation. Less voltage is obtainable 
at high frequencies than at low frequencies, but the exact form 
of this relationship varies with the position of the recording 
electrode. If the active electrode is placed in contact with the 
round window, the greatest voltage can be obtained at about 
800 cycles in the guinea-pig. Above this optimal frequency, 
the maximal voltage falls off in almost linear relation to the 


324 THE MICROPHONIC ACTION OF THE COCHLEA 

logarithm of the frequency, as shown in Fig 128, until at 
10,000 cycles the voltage is not more than 20 per cent of the 
voltage at 1000 cycles (Coveil and Black) In some experiments 
the voltage falls off much more rapidly than is indicated m Fig 
128 At low frequencies the maximal potential also falls 
off with frequency, but much more slowly The maximal 
voltage usually obtained at the optimal frequency of 800 or 1000 
cycles is approximately 1 millivolt, although Bast and Eyster 
report a maximum as high as 2 5 millivolts 

When the active electrode is placed upon the apex, the same 
type of relation is found, but the optimal frequency is consid 
erably lower than at the round window In fact, in some 
guinea pigs the optimal frequency is at 100 cycles or even less 
In any case, this maximal cochlear potential is very nearly the 
same over a wide range of frequencies below 300 cycles Above 
300 cycles the microphotuc at the apex falls off even more steeply 
than it does at the round window 

The measurements of maximal cochlear potential at low 
frequencies are less precise and reproducible than those at high 
frequencies because of the distortions of wave form illustrated 
in Fig 124 The distortion, which is particularly evident with 
low tones, is due to the appearance of higher harmonics in 
the microphonic wave and the admLXture of large components 
originating as action potentials in the auditory nerve In spite 
of the uncertainties of measurement which these factors intro- 
duce, it is quite clear that the optimal frequency at the apex 
is lower than that at the round window and that the maximal 
potential which can be obtained at the optimal frequency at 
the apex is greater than that obtainable at the round window 
The fall of maximal potential with increasing frequency may 
be due to two possible factors It may simply be that the 
ptfceiftrA yptsatfeA Vy *hn. WJ. tniL *h/t wgu?. Cwfc. •& 
actually smaller Or it may be that the potential across the hair 
cells is as large at high as at low frequencies, but that the poten 
tial does not readily make itself felt at the electrodes This 
inefficiency is presumably due to an increase of electrical 
shunting by the tissues of the ear and head for alternating 



OVERLOAD HYSTERESIS AND FATIGUE 


325 


currents of higher and higher frequency Direct measurements 
have shown that the electrical impedance of tissues falls with an 
increase of frequency (Stevens and Davis), and the shunting 
action on the hair-cells may be expected to increase accordingly 
The importance of this and other possible factors cannot be 
estimated from available data 

It should be noted here that the values for the maximal volt 
age obtainable from the cochlea discussed above were obtained 
with one electrode on the round window or the apex and the 
other elsewhere on the body of the animal When one elec 
trode is on the round window of a guinea pig and the other is 
placed on the apex instead of on the body, the maximal voltage 
may be either increased or decreased, depending on the fre 
quency of the stimulus (Stevens and Davis) 

OVERLOAD HYSTERESIS AND FATIGUE 

It is evident from Chapter 8 that the ear, regarded as a 
mechanical system, is at first linear in relation to the intensity 
of the activating tone, and becomes nonlinear when about 20 
per cent of the ultimate maximum of mechanical and electric 
response is reached In a technical sense, this deviation from a 
linear relation represents an ‘overload’ of the system From 
the physiological point of view, however, overload is not reached 
until the cochlear potential passes through its maximum We 
have already pointed out that, when the intensity of stimula 
tion is increased beyond the point at which maximal voltage is 
reached, the electric output will begin to fall off again The 
diminution is a function of the intensity of the stimulation and 
the length e£ ttrae vthvch *A eeWravaes With wtararai 
mal stimulation there is no suggestion of fatigue — that is to say, 
no diminution of activity from prolonged stimulation Stimu 
Iation which is 40 db supramaximal, however, may reduce 
the cochlear potential to 30 or 40 per cent of its original maximal 
value When, after such depression, the intensity of stimulation 
is diminished step by step and the corresponding cochlear poten 
tials are measured the original curve is not retraced (see Fig 
129) The points follow a new and somewhat lower curve 



326 


THE MICROPHONIC ACTION OF THE COCHLEA 


The threshold is also found to be somewhat elevated, and a 
period of recovery is necessary before the original threshold 
and the original voltages for a given intensity can be obtained 
again. The period of time necessary for recovery from such 
an overload depends upon the severity of the depression. If 
the depression is slight, a few minutes’ recovery may suffice; 



INTENSITY IN OB 

Fig 129. The size of the cochlear potential as a function of the intensity 
of the stimulus The abscissa represents an arbitrary intensity scale against 
which is plotted the size of the cochlear potential as it appears on the oscillo- 
graph. The arrows indicate the direction of intensity variation for the three 
different hysteresis loops The intensity was changed by 10 db at 20-second 
intervals, and curves I, 2, and 3 were traced in that order As the response 
went through the maximum, the wave form became highly complex (Stevens 
and Davis ) 

but, if it is severe, it may be a matter of hours before the original 
sensitivity is restored. To this phenomenon we may apply the 
descriptive term hysteresis, because it emphasizes an analogy 
*£i, t jK&ssss$^ «/«&. *3ait aoii 

zation of iron. The hair-cells and iron are similar in that they 
must both be restored to their original state before a previous 
t response-curve can be retraced. The curves of Fig. 129 may 
aptly be compared to hysteresis loops. 

The depression following supramaximal stimulation was dc- 




IMPULSIVE STIMULI 


327 


scnbed by Hughson and Witting and was termed by them audt 
tory fatigue If by ‘fatigue’ we mean all depression of response 
as a result of previous activity, the term is obviously appropriate, 
but it seems that the process is more akin to a temporary patho- 
logical damage of the respondmg mechanism, smee it may re 
quire hours or days for recovery It should be observed that this 
effect does not appear except at intensities considerably beyond 
the intensity required to obtain maximal voltage from the ear 
We may therefore refer more aptly to the phenomenon as 
overload 

The disruptive effects of still more violent stimulation, and 
the degenerative changes induced by long continued stimula 
tion, will be described in Chapter 15, and the fact that such 
strong stimulation causes temporary or permanent damage to 
the hair-cells makes it reasonable to attribute the overload and 
hysteresis effects also to the hair-cells This is in accord with 
the view that the hair-cells are the structures which generate 
the microphonic potentials of the cochlea (Chapter 14) 

IMPULSIVE STIMULI 

Our description of the microphonic action of the cochlea 
has so far been based upon stimulation by pure tones, which 
are either steady or at least alter gradually, as in the modulations 
of the human voice The proposition that the cochlear micro- 
phonics reproduce accurately the pattern of the soundwaves 
impinging on the ear must be qualified when the stimulus is 
a very brief tram of waves, such as the tick of a watch, and also 
when we consider the abrupt onset and cessation of pure tones 

Impulsive stimuli in the form of sharp 'clicks’ may be con 
vemently generated by discharging a condenser through aloud 
speaker The quality of the click may be altered by the physi 
cal characteristics of the loud speaker employed Condenser 
microphone and cathode ray oscillograph show that, depending 
on the type of loud speaker employed, the physical disturbance, 
in terms of sound waves is a tram of waves whose frequency 
is from 2000 to 15 000 cycles or more, beginning abruptly but 
falling off very rapidly m intensity (Fig 130 B) The cochlear 



328 


THE MICKOPHONIC ACTION OF THE COCHLEA 


microphomc, and the succeeding action potential, which is gen 
erated by such a click is shown in Fig 130 A It is a senes of 
one to three or four waves, each 05 to 07 msec m duration, 
which rapidly decline in amplitude The pattern corresponds 
quite closely to the pattern of vibration of the ossicles m response 
to the sharp sound generated by an electric spark (Fig 109, 
P 263) 

The pattern shown in Fig 130 A is not due entirely to the 
cochlear microphonics but contains 
action potentials as well The situa 
tion is exactly analogous to that of 
the pure tone illustrated m Fig 124 
in which the action potential appears 
as a hump on the otherwise sinusoidal 
wave of the cochlear microphomc 
With impulsive stimuli, however, the 
action potential can be more clearly 
differentiated, as we shall sec m more 
detail in Chapter 16 (cf particularly 
Fig 147) The action potentials do 
not appear until at least 06 msec 
after the first wave of the cochlear 
nweiophomcs The first cycle of the 
pattern m Fig 130 A is exclusively 
cochlear microphomcs and can legiti 
mately be compared with the pattern 
of mechanical vibration shown in 
Fig 109 (p 263) The later waves 
of Fig 130 A are a complex mixture 
of microphomc and action potential 
Fig 130 C, taken after the death of 
the animal, represents pure cochlear 
microphomcs and shows in the electrical record the natural 
period of the ear as a whole 

The details of wave length, wave form, abruptness of onset, 
etc, of the electrical pattern depend upon the pattern of the 
train of impinging sound waves These minor differences arc 


A 

B 


c 

Fig 130 Oscillograms of 
the cochlear microphones in 
response to clicks 
A From the round win 
dow of a cat in response to 
a dull dick The pattern is 
a composite of microphomc 
and action potential 
B Sound wave pattern of 
the click recorded through a 
condenser microphone. 

C Post mortem cochlear 
microphon c from the round 
window The stimulus is the 
same as for A except that its 
intensity is greater The ac 
tion potential component has 
d sappeared (Davis Derby 
ahwt, bom and Saul ) 


SB-. 

m 


W1N00 W 

MSECS- 



IMPULSIVE STIMULI 


329 


reflected, in turn, in the pattern of nerve-impulses which they 
generate. This corresponds to the obvious psychological fact 
that we can detect differences in the tonal quality of the clicks 
produced by different instruments. 

On-Effect. The record of the cochlear microphonic gener- 
ated by the sudden onset of a strong pure tone is complicated 
(Fig. 131 B). Even when we arrange to start the stimulus 



Fic 13! Oscillogram showing the on-cffect and off-effect m the cochlear 
microphonic. 

A Soundwaves at 2500 cycles recorded by condenser microphone The 
on-effect and off-effect of the loud speaker are just visible as a small transient 
wa\e at the beginning and another at the end of the main wave-train The 
slight temporary reduction m amplitude near the beginning of the wave train 
is due to interference by an echo m the sound tube 

B Cochlear microphonics from the round window of a cat in response to 
the same sound waves at 2500 cycles The on-effect and off-effect introduced 
by the ear are very prominent. The response to the tone is nearly maximal 
The on-effect and off-effect are less prominent with less intense tones Note 
the similarity of the on-effect and off-effect to the response to a click shown 
in Fig J30A (Derbyshire and Davis, 2 ) 

without a gross physical transient, to avoid the production of 
echoes, and to anesthetize the animal so deeply that the muscles 
of the middle car give no reflex contraction — even with all 
these precautions, the onset of electrical activity is not smooth 
and simple. A pattern very closely resemblmg that resulting 
from a single isolated click initiates the activity. When the 
stimulating tone is very strong, the on-effect may be several milli- 
seconds in duration. The size and exact pattern of the on-effect 
depend somewhat on the phase of starting the tone, that is to say. 


330 


THE MICROPHONIC ACTION OF THE COCHLEA 


whether the electric current to the loud speaker is closed when 
the electric wave is near peak voltage or near zero voltage Dur 
ing the on-effect, however, the response to the individual waves 
of the stimulating tone appears The frequency and wave form 
of the stimulating tone are reproduced within 1 or 2 vibrations, 
although they are superimposed on the on effect There is no 
long-continued build up of response, such as would be expected 
if the ear were a sharply tuned, slightly damped resonating sys- 
tem It is also evident that any sudden disturbance, whether 
an isolated click or the beginning of a continuous tone, causes 
a disturbance whose form depends upon the characteristics of 
the ear as well as those of physical stimulus 

Off Effect When a stimulating tone stops abruptly, elcctn 
cal activity does not immediately cease Instead, it subsides 
gradually, according to a pattern very much like that of the on 
effect For a given intensity of stimulation, the off-effect is 
usually less prominent than the on effect, but both consist of 
rather similar waves of about the same frequency which decline 
with about the same rapidity Figure 131 A shows that both 
on-effect and off-effect are observed experimentally under condi 
tions in which records taken through a condenser microphone 
apparently show no corresponding disturbance in the physical 
stimulus If a listener’s ear is placed in the position of the 
condenser microphone, however, the listener reports that the 
tone seems to begin and to end with a distinct click The click 
in this case is primarily an aural click, generated by the mech 
an ism of the ear 

The ‘click pattern,’ the on effect, and the off effect all ap 
parcntly express the same fundamental fact namely, that the ear 
is essentially a mechanism possessing inertia and elasticity The 
period of vibration disclosed by these transient patterns corre 
sponrfs fairly wen, as we might expect, to the range of frequencies 
to which the ear is most sensitive In the experiments with 
cat and guinea pig it will be recalled that these frequencies he 
between 750 and 2000 cycles It has been pointed out in Chap 
ter 10 that the transmission system of the ear is considerably, 
but not completely, damped This is true for the general over 



MICROPHONICS IN RELATION TO TRAVELING WAVES 331 


all vibration revealed by the ‘click response’ and the on effect 
and by the mechanical vibration of the ossicles The cochlear 
microphonics furnish no evidence of undamped resonant struc- 
tures, except for this natural period of the transmitting system 
as a whole 

It is not surprising that the cochlear microphonics for click, 
on effect, and off-effect should resemble one another When 
we analyze a brief auditory stimulus, we find that the sudden 
beginning and the sudden termination of a tone are equivalent 
to the appearance of acoustic energy over a wide range of the 
spectrum (cf p 103) A nearly instantaneous change of 
pressure, representing a click or a tone that is started or stopped, 
is represented by a nearly continuous band of frequency 
components In a certain physical sense, then, a click is 
actually generated by the sudden starting or stopping of a tone, 
and it is not surprising that the corresponding cochlear micro- 
phonics resemble one another In interpreting the on-effect, 
it is helpful to recall that a wide spectrum, similar to that repre 
sented in Fig 39 (p 104), is present At the onset the ear 
starts to respond to all the components, including the principal 
frequency All except the principal tone are quickly damped 
out, however, so that the on-effect is transient, but the response 
to the principal frequency persists and is revealed by the coch 
lear microphomc 

The response of the ear to a complex acoustic spectrum is 
determined m part by its own selective sensitivity The ear is 
most sensitive to 2000 cycles and responds preferentially to the 
components of the click near its own peak of sensitivity 

COCHLEAR MICROPHONICS IN RELATION TO 
TRAVELING WAVES ON THE BASILAR MEMBRANE 

The aural microphonics recorded electrically from the round 
window in response to a click present certain features which at 
first glance seem at variance with the concept of a traveling 
wave developed in Chapter 10 In the first place, the latency 
of the aural microphomc with respect to the time of arrival of 
the sound wa\ e at the eardrum is not greater than 0 1 msec 



332 


THE MICROPHOMC ACTION OF THE COCHLEA 


This brief latency seems to imply that there is no time for the 
propagation of a traveling wave Furthermore, the pattern of 
the aural microphomc corresponds very closely to the pattern 
of mechanical movement executed by the eardrum and the 
stapes The difficulties disappear, however, when we remember 
that the aural microphomc which we record from the round 
window os generated primarily by the movement of the portion 
of the basilar membrane near the round window According 
to Bekesy s model, the movements of this portion should occur 
practically simultaneously with the inward or outward move 
ment of the stapes and should accurately reflect the temporal 
pattern of this movement 

Following upon the movement of the basilar membrane near 
the round window, a wave of disturbance presumably travels 
toward the apex of the cochlea and arrives at the hehcotreraa 
after from 1 5 to 2 0 msec Depending upon the sharpness of 
the sound wave of the click, the traveling wave dies out more 
or less rapidly as it progresses along the membrane (cf Fjg 117, 
p 283) Hence, as viewed at the round window, the part of 
the cochlear potential contributed by the wave when it ap 
proaches the hehcotrcma becomes smaller, not only because the 
disturbance is getting farther away from the electrode at the 
round window but also because the wave itself is dying out 
Added to these two factors, there is another effect which pre 
vents clear registration of the microphomc of a click The 
latency of the response of the auditory nerve is such that the 
first action potentials make their appearance just as the traveling 
wave is approaching the helicotrema The presence of the 
action potentials makes it impracticable to derive from the form 
of the total pattern a clear picture of the late phases of the 
behavior of the basilar membrane in response to clicks (cf 
Fig 130) 



CHAPTER 14 


CONSIDERATIONS AS TO THE NATURE 
AND ORIGIN OF AURAL MICROPHONICS 

The aural microphonics — the electric potentials generated in 
the cochlea as a result of stimulation by sound — have proved of 
great value in the study of the function of the inner ear, but 
their exact nature and significance are still not entirely certain 
It is generally agreed that they are generated within the cochlea 
They are not generated by the tympanic membrane or by the 
ossicles, since these structures may be damaged, or the continuity 
of the ossicular chain may be interrupted at the mcudostapedial 
joint, and yet the electric potentials will still be generated in 
response to sounds delivered to the cochlea by bone-conduction 
In such experiments the threshold for bone-conduction is 
scarcely elevated (cf p 295) 

The most generally accepted hypothesis ascribes the origin 
of aural microphonics to the organ of Corti — specifically to the 
hair-cells of that organ This view is based primarily on the 
correlation between the apparent state of these cells and the 
threshold or the maximal amplitude of the miciophonics 
Some investigators have failed, however, to confirm this correla 
non and have advanced other hypotheses as to the origin of the 
electric potentials One of the alternative hypotheses ascribes 
the potentials to the terminations of the fibers of the cochlear 
nerve Another emphasizes the mechanical vibration of polar- 
ized membranes, particularly Reissner’s membrane Yet an 
other ascribes the electrical effect to “streaming potentials” 
generated by the movement of fluid in the channels of the inner 
ear or through the pores of the various membranes We may 
say at once that the neural hypothesis has been abandoned by its 
proponents as untenable in the light of recent evidence The 
microphonics duffer from the action potentials of the nerve in 
seieral fundamental respects, including differences in wave- 
333 



334 THE NATURE AND ORIGIN OF AURAL MICROPHONICS 


form, in latency, in polanty, in limits of frequency, in resistance 
to cold, to lack of blood supply, and to fatigue, and in the 
phenomenon of masking The non neural theories share one 
basic feature in common the energy of the electrical disturb 
ance is derived ultimately from the sound waves and not from 
the metabolic activities of the tissues The cochlea does not 
release stored energy, as does a muscle liber or nerve-cell, but 
acts merely as an electromechanical transducer, converting me 
chamcal into electrical energy The various non neural theories 
differ as to which structures and which physical principles are 
responsible for this transformation 

A feature shared by all theories is that the structure respan 
siblc for the electrical phenomena is differentiated with respect 
to the upper and lower ends of the cochlea in such a way that 
the electric potentials generated by low tones arise near the 
apical end of the cochlea, and those generated by high tones 
arise near the basal end The experimental evidence which 
demonstrates this fact will be presented in the following chapter 
in connection with the localization of frequency reception on 
the basilar membrane 

It is difficult to deny that electric potentials may be generated 
by movement of fluid within the cochlear canals and through 
the various membranes (provided such movement through the 
membranes actually occurs) Also, if Reissner s membrane and 
the basilar membrane arc electrically polarized, as may well be 
the case, their vibration must cause some corresponding elec 
tncal disturbance (Hallpike and Rawdon-Smith 2, 3, 4) The 
question is whether these effects account for all or even for a 
significant part of the observed potentials The hair cell theory 
asserts that they do not and that integrity of the hair cells of 
the organ of Corti is essential for the generation of the aural 
microphonics 

EVIDENCE FOR AND AGAINST THE HAIR CELL 
THEORY 

I Congenitally Deficient Animals Nature occasionally 
provides animals which lack the organ of Corti wholly or in 



EVIDENCE FOR AND AGAINST THE HAIR-CELL THEORY 335 


part Examples of such animals are albinotic cats, with white 
coats and blue eyes, and waltzing guinea pigs Howe and 
Guild examined three albinotic cats which were clinically deaf, 
but they could detect no electric activity in the cochleas His- 
tologically the ears showed practically complete absence of the 
organ of Corti, including all its hair-cells, and also pathological 
changes m the macula of the saccule and collapse of the cochlear 
duct Davis, Derbyshire, Lurie, and Saul reported a similar 
case in which one ear had no organ of Corti and was inactive 
electrically, whereas the other ear was normal in both respects 
They also found that the microphonic effect was absent in a 
waltzing guinea pig whose organ of Corti was normal as far 
as the tunnel and the supporting cells were concerned, but whose 
hair-cells were abnormal, particularly m the basal turns An 
other such animal (Lurie, Davis, and Derbyshire) yielded small 
microphonics, with high thresholds, to tones below 1800 cycles 
In this animal the degenerative changes in the hair-cells were 
most marked m the three basal turns Another animal 
(Lurie, 1) showed degenerative changes m the external hair-cclls 
throughout the organ of Corti, while the internal hair-cells 
were normal This animal showed an elevation m the thresh 
old for the microphonics of 20 to 30 db throughout the auditory 
range Lune and Davis (unpublished) studied two albinotic 
dogs and eight waltzing guinea pigs m which the microphonics 
and the hair-cells were both absent In some of these animals 
there were other abnormalities, such as collapse of Reissner’s 
membrane or of the tectorial membrane In some animals the 
organ of Com was completely absent, as illustrated in Fig 132 
In others a rudimentary organ of Corti, composed of supporting 
cells and occasional rods of Corti but without hair-cells, occupied 
part of the basilar membrane In none of the animals could 
microphonics be detected, but in all the animals m which any 
normal hair-cells were present corresponding cochlear potentials 
were found 

Bast and Eyster, on the other hand, reported studies of 
several animals which gave essentially normal electric potentials 
although the organ of Corti was either entirely absent or highly 



336 THE NATURE AND ORIGIN OF AURAL MICROPHONICS 


atrophic, except for part of the basal turn Thar measurements 
are given in terras of microvolts of electric potential developed 
in response to sounds of various frequencies, generated by alter- 
nating currents of fixed voltage These measurements are not 
directly comparable either with determinations of threshold, 
with equal potential contours, or with curves expressing maxi 
mal potential as a function of frequency Nevertheless, the 
“practically normal” curve which they obtained by this method 
from the animal which had a normal organ of Corti m part of 
the basal turn is comprehensible, for, with strong stimulation, 
considerable electric potential may be generated by a small 



Fio. 132 Section through the basal turn of the cochlea of a waltzing 
guinea p'g This animal was clinically deaf and yielded no cochlear micro- 
phonics Note the absence of the organ of Com and the presence of basilar 
membrane, Rcissner s membrane, and tectorial membrane The spiral gan 
glion of Corn contains only a small fraction of the usual number of nerve 
cells Compare this figure with Figs 113 and 135 (Lurie, 1 2) 

portion of the cochlea This is the general experience of many 
investigators In unresponsive animals, Bast and Eystcr found 
that Reissner’s membrane was usually collapsed and adherent 
to the basilar membrane and organ of Corti The hair-cells 
are invariably more or less abnormal in such circumstances 
Also, the collapse of Reissner’s membrane almost certainly modi 





EVIDENCE FOR AND AGAINST THE HAIR-CELL THEORY *37 


fics the mechanical vibration of the finer structures of the organ 
of Corti beneath it, so that it is not surprising, according to either 
theory, that the electrical activity should be diminished None 
of these considerations, however, explain the normal aural mi 
crophomcs reported for an animal with no organ of Corti 
whatever 

2 Stimulation Deafness Animals exposed to strong tonal 
stimulation over long periods of time often show a definite 
depression of sensitivity which is confined to a portion of the 
auditory scale (cf Chapter 15) Elevation of the threshold for 
tones of medium pitch correlates with abnormality of the organ 
of Corti, particularly of the external hair-cells, over a limited 
range in the middle of the basilar membrane No other ana 
tomical abnormality is to be detected m most animals thus 
exposed and tested 

3 Injection of Drugs Chemicals, such as sodium-chloride 
crystals or cocaine, when placed on the round window mem 
brane, reduce the aural microphonics In some experiments, 
solutions of drugs have been injected through the round 
window membrane, and a typical result is elevation of threshold 
and depression of cochlear microphonics for high frequencies of 
stimulation Histological examination, as a rule, shows damage 
to the sensory cells in the basal turns We find, here again, 
the double correlation of elevation of threshold associated with 
abnormality of the sensory cells, with this elevation of threshold 
confined to that part of the frequency scale which corresponds 
to the position of the anatomical damage on the basilar mem 
brane The deficiency of response to high tones is associated 
with damage to the hair-cells of the organ of Corti in the basal 
turns 

4 Surgical Damage to the Cochlea Surgical lesions of vary 
ing degrees of severity have been produced in cats and guinea 
pigs by many investigators, and the effect on electrical activity 
determined immediately or following a period of recovery and 
repair The results vary widely and various interpretations 
have been placed upon them Obviously, many factors, me 
chamcal, biological, and electrical, are involved Some lesions 



338 THE NATURE AND ORIGIN Of AURAL MICROPHONICS 


clearly interfere with the physical transmission of sound waves 
These lesions include extensive hemorrhage into or collapse 
of the scala tympam or scala vestibuh in the basal turn, and also 
extensive perforation of the basilar membrane in this region 
The latter lesion apparently acts as a ‘short circuit’ for the 
sound waves Other lesions may modify the conditions of elec 
trie current flow in and around the cochlea Either the coch 
lear potential may be locally short-circuited, as by an extensive 
lesion in the basilar membrane, or the flow of current to an 
external electrode may be hindered, as by thickening and 
scarring of the round window membrane Depression of elec 
trical activity from either of these general causes must be dis 
counted in drawing conclusions as to the origin of the potentials 
Many experiments are inconclusive because of such comphca 
tions 

Acute surgical damage to the cochlea of the gumcapig 
shows that extensive lesions, such as complete removal of the 
two apical turns, or the opening of either the scala tympam or 
the scala vestibuh in such a way as to allow the cochlear fluid 
to ooze from the opening, does not necessarily abolish electrical 
activity The threshold may be altered somewhat and the 
maximal response may be depressed, perhaps more for some 
frequencies than for others, but a surprising degree of activity 
often persists A small section of normal organ of Corti may 
generate, at appropriate frequencies, an electric potential quite 
as great as an entire cochlea It will also generate some poten 
tial at all frequencies These facts have an obvious bearing on 
the question of the ‘tuning’ of various parts of the cochlea for 
particular frequencies, for they show that such ‘tuning’ is 
not sharp or circumscribed, and they make more difficult any 
correlation with damage to one or another structure or region 
within the codruea Hcrctthtfesv, -ft -a pcoskAt ‘t> tfe&trft tWca 
tions of the threshold for cochlear microphonics over certain 
frequency ranges and to compare the limits of normal threshold 
with the anatomical limits of the normal organ of Com The 
consistent correspondence between definite frequencies and 
position on the basilar membrane (see Chapter 15) found by 



EVIDENCE FOR AND AGAINST THE HAIR-CELL THEORY 339 


this method constitutes important evidence in support of the 
hair cell theory 

5 Degeneration of the Auditory "Nerve When the fibers 
of the auditory nerve are cut at the internal auditory meatus, 
the cells of the spiral ganglion and the peripheral portions of 
the nerve fibers within the cochlea degenerate (Crowe) This 
is an exception to the usual law of Wallerian degeneration, ac 
cording to which only the portions of the neurons which have 
been separated from their cell bodies should degenerate Ac 
tually the ganglion cells, and ultimately the sensory hair cells 
as well, degenerate quite completely The degeneration of the 
hair-cells may depend upon partial interference with the blood 
supply of the cochlea (Guttman and Barrera), which is difficult 
to avoid since the mternal auditory artery runs in close associa 
tion with the nerve (see Fig 112, p 270) If the artery is com 
pletely severed, the entire organ of Corti, as well as the nerve, 
degenerate rapidly, and following such complete degeneration 
no cochlear microphomcs or action potentials can be detected 
In most instances m which the nerve has degenerated, but in 
which the organ of Corti still remains normal, the microphomcs 
persist One such animal however, yielded no microphomcs 
(Hallpike and Rawdon Smith, 3) The auditory nerve had been 
cut for twelve weeks, and degeneration, or some other type of 
atrophic change, may have begun, but if so it escaped histologi 
cal detection Hallpike and Rawdon Smith originally inter 
preted this case as evidence for a neural origin of the aural 
microphomcs but have subsequently (Ashcroft, Hallpike, and 
Rawdon-Smith) abandoned this view m favor of a polarized 
membrane theory One possible anatomical lesion which causes 
loss of the cochlear microphomcs they believe to be blockage 
of the cochlear aqueduct This blockage is assumed to lead 
to changes in the chemical relation between perilymph and 
endolymph, with consequent loss of a postulated polarization of 
Reissner’s membrane 

A subsequent series of eleven cats studied by these same 
investigators provided the experimental basis for their present 
theory The auditory nerves had been cut at various intervals 



340 THE MATURE AND ORIGIN OF AURAL MICROPHONICS 


of 2 days to 23 weeks, prior to the electrical test Some aural 
microphonics were found in all eleven animals Subsequent 
microscopic examination revealed nearly all possible relation 
ships between the degree of normality of the organ of Corti 
and the maximal cochlear potential measured at four frequen 
cies (256, 512, 1024, 2048 cycles) Depressed microphomcs were 
found in the presence of practically normal organs of Corti and 
normal microphomcs with severely degenerated hair-cells 
Some of these cases conform to the hair-cell theory Others 
can be reasonably explained if we realize that a small region 
of normal organ of Corti near the round window can yield a 
maximal potential, as measured by an electrode on the round 
window, which is nearly as great as that obtained from an en 
tire organ of Corti All examples of this group had “normal” 
organ of Corti in the basal coil, and the depression of electrical 
activity was nearly proportional to the time elapsed since cutting 
the nerve Three animals more nearly resembled the original 
case discussed above, m that they had normal organs of Corti 
but greatly reduced microphomcs Two were tested 13 weeks 
after operation, and it is significant that no animal showed 
good responses after so long an interval In the third animal, 
tested after 4 weeks, the weakness of the cochlear microphomcs 
may be related to the “considerable hemorrhage” found in the 
apical portion of the scala vestibuli But, even if no explanation 
for this case is to be found, it requires more than one such case 
to render untenable the hair-cell theory, which accounts so 
completely for the losses of cochlear microphomcs which are 
restricted to certain frequency limits It is difficult to imagine 
why a local loss of polarization of Reissner’s membrane should 
occur The very fact that there is any localization of the dec 
tncal effects along the cochlea, as a function of frequency, 
strongly suggests the basilar membrane or structures attached 
to it as the origin of the electric potentials, and, if we admit 
that the basilar membrane is nearly critically damped (cf p 
286), we avoid the chief reason given by Hallpike, Hartndge, 
and Rawdon Smith for turning to Reissner’s membrane for the 
source of the cochlear microphomcs 



THE HAIR CELL THEORY OF THE AURAL MICROPHOM1CS 341 


We have attempted in some detail to offer possible explana 
tions for this series of observations which superficially seem to 
render the hair-cell theory madequate The chief ad hoc as- 
sumption invoked is that denervation of the hair-cells or partial 
interference with their blood supply may, after 8 or 10 weeks, 
impair their electrical properties without necessarily causing 
notable microscopic alterations 

More experimental work is necessary for a final evaluation 
of the various theories, and much may be hoped from electrical 
tests m which complete response-curves, as functions of in 
tensity at various frequencies, are obtained These complete 
functions should be far more illuminating than curves for con 
stant stimulus, for maximal response, or for threshold alone 
In the meantime it appears that the hair-cell theory has the 
advantage of describing adequately a greater number of the 
known facts than do the alternative suggestions 

THE HAIR CELL THEORY OF THE 
AURAL MICROPHONICS 

The hair cell theory assumes that all, or nearly all, the micro- 
phonic effects of the cochlea depend upon the integrity of the 
hair-cells of the organ of Corn These cells can no longer 
generate microphonics when they have degenerated to such an 
extent that, on subsequent microscopic examination of freshly 
fixed, well stained specimens, their nuclei are no longer normal 
in appearance, or when the cells have undergone marked 
cloudy swelling Normally, however, the cells are believed 
to be capable of generating an electric potential between their 
bases and their free ends, that is to say, perpendicular to the 
plane of the basilar membrane, as diagrammed in Fig 133 A 
When the stapes is displaced in such a way that the basilar 
membrane moves upward, Fig 133 A shows how a systematic 
polarization of the hair-cells would cause electrical positivity 
of the round window and negativity of the oval window Let 
us consider in more detail the probable mechanical events in 
the organ of Corti 

Figure 133 B represents a section across the basilar mem 



342 the nature and origin or aural microphonics 




Fig 133 Diagrams of sections through the cochlea 
A Longitudinal section showing schematically how an outward movement 
of the stapes generates a cochlear microphomc due to distortion of the hair 
cells The cells are mounted 'in parallel on the membrane and the potential 
difference between their ends can be detected by electrodes, as shown A 
difference of potential is also detectable between either of these electrodes and 
another electrode elsewhere on the body of the animal 
B Cross section (drawn to scale from a photomicrograph) showing the 
arrangement of the sensory cells in the organ of Corti The one inner and 
three outer hair-cells are placed on either side of the tunnel formed by the 
rods of CorU, and their cilia are embedded in the tectorial membrane When 
the basilar membrane between the points B and C is displaced upward by an 
outward movement of the stapes a distorting pressure is exerted on the outer 
hair-cells, as shown by the arrow A strong stimulaung sound wave will 
move the entire organ of Corn about the pivot at point A and thereby compress 
the internal hair-cell as well The pressure of the stimulus needed to afTect 
the internal hair-cells must be about 50 times as great as that which is needed 
to stimulate the external cells The fibers of the auditory nerve arc excited 
during the upward movement of the basilar membrane 



THE HAIR-CELL THEORY OF THE AURAL MICROPHONICS 343 


brane, traced from an actual photomicrograph When the 
basilar membrane bulges upward or downward, its curvature is 
probably much sharper across the membrane (Fig 133 B) than 
along the membrane (Fig 133 A ) Furthermore, it seems 
probable that the triangular structure of the tunnel, formed by 
the rods of Cortt and the portion of the basilar membrane be 
neath it (A B in Fig 133 B ), is suffer than the external portion 
of the membrane B C Point A is rigidly fixed, since it is at 
the attachment of the basilar membrane to the bony lamina of 
the modiolus Point B will move slightly upward, as the stapes 
moves outward, but, at least at low amplitudes of vibration, the 
chief movement will be a bulging upward between B and C, as 
indicated by the broken line in the figure Such a bulging will 
cause lateral or oblique compression of the external haw-cells 
Movement of the rods of Corti as a unit at higher amplitudes of 
vibration will compress the internal hair-cell also The driving 
force must be about 50 times as great as that needed to stimu 
late the external cells m order to stimulate the internal cell (cf 
p 369) When the membrane bulges downward, all these 
effects are reversed and the hair-cells are subjected to lateral or 
oblique ‘traction’ instead of compression It is also possible 
that the tectorial membrane entangles the cilia of the hair-cells 
and serves to make the cells more sensitive to very slight amplt 
tudes of vibration 

It will be shown in Chapter 16 that the phase of the vibra 
tion of the basilar membrane during which the nerve impulses 
are initiated is the upward swing associated with outward move 
ment of the stapes According to this present schema, the 
upward phase of basilar motion is the one associated with com 
pression of the hair-cells The association of stimulation with 
compression is in accord with the general behavior of tactile 
sense-organs, and, furthermore, if we are to accept the theory of 
excitation of the nerve fibers by a chemical substance (see Chap 
ter 16), it seems more reasonable to associate the liberation of 
such a substance with compression of the hair-cell than with 
‘traction’ upon it 

This outline of the mechanical events in the organ of Corti 



344 THE NATURE AND ORIGIN OF AURAL MICROPHONICS 


is hypothetical and is suggested merely as a possible mechanism 
The mechanism as outlined fulfills one necessary condition, 
however, in that it shows how the hair cells may undergo one 
and only one cycle of compression and traction during one cycle 
of vibration of the basilar membrane If we assume that the 
hair cells generate piezoelectric potentials between their upper 
and lower ends when they are compressed, according to the 
analogy of the piezoelectric crystal outlined m Chapter 12, we 
have a possible explanation of the generation of the cochlear 
microphonics 

It is extraordinary that in the ear the minute mechanical 
vibrations and tiny pressures involved at threshold should 
produce sufficient potential to be detected from outside the 
cochlea The actual movements may be of molecular dimen 
sions, smcc the threshold of detectable potential is near the 
threshold for human hearing and the human car can hear 
sounds, at favorable frequencies, which involve movement of 
the eardrum of less than the diameter of a hydrogen molecule 
(see p 56) The minuteness of these amplitudes is compatible 
with the hair-cell theory, which ascribes the generation of the 
potential to a semisoltd structure, such as a cell containing or 
gamzed, oriented molecules Furthermore, the external detec 
tion of the potential is undoubtedly favored by the parallel 
orientation of the generating cells, by the favorable electrical 
circumstances offered by the conducting endolymphatic chan 
nels, and by the insulating shell of bone around the cochlea 

Now, the hair-cell theory of the aural microphonics which 
ascribes the cochlear potential to a piezoelectric effect m the end 
organ must, if our hypothesis is adequate, satisfy the law of the 
conservation of energy We have assumed that the hair-cells 
behave as electromechanical transducers and convert into elec 
tncity the acoustic energy delivered to them Unlike the nerve 
fibers, the hair-cells do not supply the energy for the electric 
response— they merely convert energy from a mechanical into 
an electrical form This principle means, then, that no more 
energy should appear in the form of electricity than is delivered 



THE HAIR-CELL THEORY OF THE AURAL MICROPHONICS 345 


to the ear in the form of sound The power-output must not 
exceed the power input Although the two energies, acoustical 
and electrical, cannot be computed exactly, we can show that 
a reasonable equivalence exists The argument may be made 
as follows 

An 800-cycle tone at 60 db above the reference intensity 
produces an electric potential of about 10 -4 volt The power m 
this sound wave is 10" 10 watt per square cemtimeter Now, 
this amount of power is delivered to the ear only provided 
the impedance of the ear is equal to that of air and provided the 
ear represents a cross-section of 1 sq cm The impedance of 
the eardrum at 800 cycles is essentially that of air (see Fig 108, 
p 261), and the area of the drum in the cat is about 05 sq cm 
Hence, 5X10* 11 watt delivered to the eardrum produces a meas- 
ured potential of 10“* volt But since power is lost between the 
eardrum and the cochlea, let us assume that only 10* 1 * watt 
reaches the organ of Corti If this amount of energy produces 
10 -4 volt, we can compute immediately the resistance across 
which this voltage is generated, because in an electric circuit 
the power P in watts is given by 

P = E 2 /R 

where E is voltage and R is resistance Or 
IQ- 11 = (10“*) 2 /K 

The value of R, therefore, is 1000 ohms and the question is 
does this represent a reasonable value ? Direct measurements 
have not been made, but from the resistances encountered in 
cellular structures elsewhere m the body it is apparent that this 
is very nearly what we should expect to be the electrical resist 
ance between the two sides of the basilar membrane This 
value is at least of the right order of magnitude to justify our 
hypothesis that there is an equivalence between the acoustic 
power delivered to the organ of Corti and the electric power 
generated by the hair cells 



348 THE NATURE AND ORIGIN OF AURAL MICROPHONICS 


though they were connected in parallel But, from the point 
of view of a pair of electrodes, one of which is on the round 
window or apex and the other on a relatively distant part of 
the head, the simple geometrical arrangement of the hair-cells is 
complicated by several factors, including the spiraling of the 
basilar membrane Consequendy, the hair cells contribute in 
no simple fashion to the potential across the recording electrodes 
The connection is neither series nor parallel, but a combmation 
of the two Despite these complicating factors, we may expect 
that cells near the apex will influence an electrode applied to 
the apex more than do cells near the base This expectation 
depends upon the more favorable position of the apical cells for 
contributing to a potential measured between the apex and a 
remote electrode The reverse must be true of cells near the 
round window — they contribute most readily to the potential 
seen at the round window 

At a given frequency, and for low intensities, the form of the 
pattern of vibration of the basilar membrane presumably re 
mains constant as the sound intensity vanes More or less 
potential will be generated by each activated cell, but the relative 
contnbution of each to the total will remain constant Increase 
m the total potential is then due to an increased contribution by 
each cell (This situation is quite different from that in a 
structure composed of units, like nerve fibers, which follow an 
alt or none law The number of active nerve fibers increases 
with increasing strength of stimulation, but the potential gener 
ated by each unit remains constant ) 

Potential as a P unction of Sound Intensity How, then, are 
we to interpret the linear portion of the curves in Figs 125, 126, 
and 127 relating the potential of the cochlear microphonic to 
sound intensity ? Where there is linearity and a slope of unity, 
it is implied that over the corresponding range of intensities 
(a) the transmission system delivering energy to the hair-cells 
is behaving in linear fashion, (b) the relative distribution of 
mechanical activity over the basilar membrane remains con 
stant, ( c ) the potential generated by the hair cells is a linear 
function of their mechanical distortion, and ( d ) no distorting 



QUANTITATIVE RELATIONS OF THE COCHLEAR MICROPHONICS 349 


factors or nonlinear contributions of potential from other 
sources, such as nerve fibers, have appeared Experimentally, 
this linear relation with a slope of unity appears to be the ideal 
case, and other relations between cochlear microphonics and 
sound mtensity can be regarded as deviations from this ideal 
relation 

As sound mtensity increases, the increase of potential regu 
larly becomes nonlinear when it reaches about 20 per cent of 
its ultimate maximal value The appearance of higher har 
monies indicates that this distortion is due to nonlinear per 
formance of the transmission system Ultimately the piezo- 
electric action, of the hair-cells may also become nonlinear 

The mechanical displacement of any segment of the basilar 
membrane itself is presumably linear at low intensities, but, like 
all membranes, it ultimately reaches a limit beyond which lin 
eanty fails We do not know whether this limit is symmetrical 
or unsymmetrical with respect to the position at rest However, 
as long as the response of each segment of the membrane is 
linear, the relative distribution of energy along the membrane 
will remain constant As soon as any segment reaches its limit 
of linearity, the pattern of distribution of activity must change 
with further mcrease of intensity Since the region of the 
basilar membrane which is maximally disturbed will reach this 
limit first, the pattern of the disturbance will change when this 
limit is reached The limit of linearity essentially represents a 
point of diminishing returns beyond which the amplitude does 
not keep pace with an mcrease in the displacing force There 
fore, after the maximum of the disturbance on the membrane 
has reached this limit, the disturbance on either side of the 
maximum will grow more rapidly than the maximum itself 
Now, since the form of the over all resonant properties of the 
ear is imposed upon this process, it is plainly possible that the 
relative increase on the two sides of the maximum may not be 
equal, and the disturbance may become skewed The skewing 
would represent a shift of the maximum to a new position and a 
consequent change m the apparent pitch of the tone We have 
in these notions a hypothetical explanation of the change of 



350 THE NATURE AND ORIGIN OF AURAL fiGCROPHON'ICS 


pitch with intensity (sec Chapter 3), as well as a possible ex- 
planation of certain anomalies in the behavior of the cochlear 
microphonics 

The Combination of Action Potentials with the Cochlear 
Mtcrophomcs The action potential appears as a secondary 
wave in the electrical output of the cochlea The action poten 
tial is made up of many nerve impulses, more or less synchro- 
nized, all of which exhibit approximately the same time-course 
and the same potential Each nerve fiber has a threshold which 
probably corresponds to some finite degree of distortion of the 
hair cell to which it is attached, and each is all or none m its 
response The effect of each fiber on the recording electrodes 
varies, of course, with the distance and orientation of the fiber 

The action potentials and the cochlear microphomc may 
combine to give an electric potential either greater or less than 
the microphomc alone The variability encountered depends 
upon the nonsinusotdal shape of the action potential and on 
changes in the phase relation between microphomc and action 
potential with change of the frequency of the stimulus (see 
Fig 124, p 317) 

In the curve expressing the combined electrical output of 
microphomc effect plus action potentials as a function of inten 
sity, we may find slopes either greater or smaller than unity 
At intensities slightly above threshold, we may expect a very 
rapid growth of the action potential component, for sudi in 
creases have actually been observed in the eighth nerve at low 
intensities If the action potential reinforces the microphomc 
effect, a slope greater than unity will result If it interferes 
with the microphomc effect, the result will be a slope of less 
than unity, as in the section of curve C m Fig 126 which lies 
between 35 and 50 db (p 320) On the other hand, at higher 
intensities the action potential may cease to grow at a rapii 
rate, and if the action potential is interfering with the micro- 
phomc effect, the mtcrference will then become less pronounced 
at higher intensities In that event, increase in intensity will 
give a slope greater than unity, for the curve will be rising 
toward the ideal curve of pure microphomc effect, which would 



TINNITUS 


351 


have been followed had it not been for the interference from the 
action potential This is probably the basis of the abrupt rise 
in curve C of Fig 126, between 55 and 65 db 

There are other possible causes of deviation from a linear 
slope, and m general they tend to give slopes of less than unity 
Among these are {a) Reflex contractions of the intra aural 
muscles The tensor tympam and the stapedius contract in 
response to stimulation by sound and reduce the transmission of 
tones below 1000 cycles (p 264) The threshold for this effect 
is about 30 db above the threshold of hearing (Lorente de 
No, 1) It is most improbable, therefore, that the reflex should 
appear at very low intensities, or after the spontaneous activity 
of the middle-ear muscles has been reduced by application of 
chloroform At frequencies above 1000 cycles, moreover, the 
transmission of sound is not affected by this reflex ( b ) A con 
stant error in the measurement of the size of the potential due 
to background noise This error is obviously important for 
the very low amplitudes of response near threshold ( c ) All 
the deviations from linearity due to arrival at limits of linearity, 
whether in the middle ear, the inner ear, or in the generation 
of electric potential in the individual hair-cells All these fac 
tors impose a limit to the electric response and cause nonlinearity 
of the response-curve for high intensities of stimulation 

TINNITUS 

If the nerve fibers of the auditory nerve are stimulated by 
any means whatever, we should expect to experience an auditory 
sensation In fact, the most satisfactory explanation of most 
cases of the persistent ringing m the ears called tinnitus is that 
certain hair-cells, or the nerve fibers connected with them, be 
come hyperirntable and discharge nerve impulses more or less 
continuously as a result of some pathological process This 
abnormal condition may be acute and temporary, as the result 
of excessive stimulation by a loud sound, or it may be chronic 
The condition which underlies the almost universal degenera 
tion of hair-cells and ganglion cells near the oval window with 
advancing age (Chapter 2) may well involve a temporary stage 



352 THE NATURE AND ORIGIN OF AURAL MICROPHONICS 

of hyperirntability This hypothesis would explain the usual 
high pitch of mild chronic tinnitus, for, as we shall see in the 
next chapter, it is the nerve fibers ending in the lower part of the 
basal turn of the cochlea which evoke auditory sensations of 
high pitch This hypothesis is attractive, but it is not to be 
taken as a universal explanation of all tinnitus, for some cases 
are undoubtedly due to irritation of the higher auditory path 
ways and centers within the brain 

ELECTRICAL STIMULATION OF THE COCHLEA 

Finally, as an aid to the understanding of the nature and 
origm of cochlear microphonics, let us consider a reverse phe 
nomenon — the clcctrophomc effect The ear behaves as an 
electromechanical transducer when it converts sound waves into 
the electrical waves which we call microphonics Like most 
other transducers, it is also capable of the reverse process, and, 
when an alternating electric current is passed through the head, 
we hear a tone whose pitch is determined by the frequency of 
the current (see p 65) Presumably, then, the ear converts 
alternating currents mto mechanical vibrations, as well as me 
chamcal vibrations mto alternating currents 

The hair cell theory of the origm of the cochlear micro- 
phomes offers reasonable explanation of the perception of a 
definite pitch when a normal ear is stimulated electrically We 
have assumed that the hair cells act as electromechanical trans 
ducers, converting mechanical mto electrical energy In this 
respect they are analogous to piezoelectric crystals It is natural 
to extend the analogy and to suppose that in the hair-cells, as 
in piezoelectric crystals, the process is reversible and that, under 
the influence of an electric field the cells tend to alter their 
shape Therefore, as the electric field changes periodically, the 
basilar membrane is made to vibrate The portion of the basilar 
membrane which, by virtue of the physical constants of the audi 
tory mechanism (mass, stiffness, and resistance, cf Chapter 10), 
is preferentially responsive to the particular frequency of the 
stimulating current will vibrate with the greatest amplitude 



ELECTRICAL STIMULATION OF THE COCHLEA 


353 


As a result of the mechanical vibration, nerve impulses will be 
initiated in the appropriate fibers in the usual fashion 

It should be clear that this hypothesis does not involve se 
lective electrical tuning of the organ of Corti The energy is 
delivered to the ear electrically, but is transformed mto median 
ical vibration The ‘tuning’ itself is purely mechanical 

Of course, we cannot, as yet, be completely certain that the 
mechanical vibrations are initiated by a piezo effect in the hair 
cells It is possible that some other structure within the car 
possesses a residual charge, due to a process of polarization, so 
that the structure is made to vibrate when immersed m an alter 
nating electric field The action, in this case, would be of the 
sort characteristic of a condenser microphone We may be cer 
tarn, however, that, regardless of the structure responsible for 
the vibrations, the fact that the perceived pitch is related to the 
frequency of the alternating current is due to the mechanical 
‘tuning’ of the cochlea 

If the electric current stimulated the auditory nerve directly, 
we should not expect to discriminate the frequency of the cur 
rent All frequencies should then sound essentially alike— they 
should all sound like noises Such, m fact, was the finding of 
Andreef, Gersuni, and Volokhov when they stimulated people 
whose cochleas were destroyed but in whom the auditory nerve 
was still able to function These people were unable to dis 
tinguish between different frequencies when the auditory nerve 
was stimulated by an electric current, and all frequencies 
sounded like a noise 

The tones heard by the electrical stimulation of normal ears 
lack the purity of tonesheard in the usual way "When listening 
to a sinusoidal electric current, one hears the higher harmonics 
very prominently In fact, some observers are able to identify 
the pitch of a tone as an octave higher than the stimulus- 
frequency And when two currents of 1000 and 1700 cycles are 
led simultaneously to the ear, a difference tone of 700 cycles 
appears which sounds louder than either of the two primary 
tones Obviously the electrophomc phenomenon is subject to 
considerable distortion 



354 the nature and origin or aural microphonics 


The seventy of the distortion introduced with electrical 
stimulation can be demonstrated by the simple procedure of 
connecting the electrodes on the observer to the output circuit 
of a radio set Music can be heard and popular tunes identified, 
but the quality is definitely poor — ‘tin pan music Speech 
can be easily recognized as speech, but only occasional words 
can be understood (Stevens, 8) Clearly, electrical stimulation 
does not promise much as an alternative means of hearing so 
long as so much distortion is present We should like, if 
possible, to be able to account for this excessive amount of 
distortion which is experienced under electrical stimulation 
Distortion may arise from two causes First, smee the moving 
elements of the ear do not strictly obey Hooke’s law, harmonics 
may arise from stimulation by a sinusoidal mechanical force 
This fact is believed to account for the normal amount of dis- 
tortion introduced when the stimulus is a soundwave It 
appears unlikely, however, that this type of nonlinearity in the 
auditory mechanism is able to produce all the distortion ob 
served under electrical stimulation Hence, a second cause 
must operate, one which appears to be electrical rectification 
Electrolytes bounded by various types of surfaces are known 
to behave as complete or partial rectifiers, m that a current passes 
the boundary more easily m one direction than m the other 
An electrolytic condenser is an example of this phenomenon 
Furthermore, whenever rectification of a sinusoidal current 
takes place, the resultant current can be analyzed into a steady 
component, the original frequency, and a series of harmonics 
Therefore, if partial rectification of the current sent through the 
ear were to occur at some boundary, we should have reason to 
expect the large distortion which was actually observed On 
the assumption that some of the distortion is due to electrical 
rectification, it is to be expected that, when a high frequency 
modulated current is passed through the head, sufficient demod 
ulation would occur to allow the ear to hear the modulating 
frequency A frequency of 100 kilocycles was modulated by a 
400-cycle wave and passed through the head of an observer, with 
the result that the observer heard a 400-cycle tone (Stevens and 



ELECTRICAL STIMULATION OF THE COCHLEA 


355 


Hunt). This experiment demonstrates that the ear can respond 
directly to a radio wave, provided the modulated radio-fre- 
quency is conducted through the head at a sufficient intensity 
Tests show that some of the rectification accompanying the 
electrophomc phenomenon occurs at the electrodes (one m the 
external car — the other elsewhere on the body) through which 
the current is applied Measurement of this part of the rectifi 
cation discloses, however, that it is too slight to account for 
the observed effects, in stimulation either by audio-frequencies 
or by radio-frequencies There appears to be an additional and 
large rectifying action taking place in the ear itself, which, of 
course, occupies only a small part of the total conducting path 
between the electrodes 



CHAPTER 15 


THE LOCALIZATION OF FREQUENCY 
RECEPTION ON THE BASILAR MEMBRANE 

In previous chapters we have seen why, from physical considers 
tions of the anatomical structure of the inner ear, we should 
expect the basilar membrane to vibrate more vigorously near 
the hehcotrema in response to low tones and near the round 
window m response to high tones, and we have considered 
some experimental evidence from the study of the cochlear 
microphonics which indicates this type of ‘tuning’ within the 
ear This notion of selective vibration is fundamental to an 
understanding of frequency discrimination and sound analysis 
by the ear, and we shall therefore consider more thoroughly the 
evidence for localized disturbances within the cochlea First, 
from a study of the effects of loud sounds, we shall sec that the 
basilar membrane actually does vibrate in response to sound 
Then, we shall see what regions are preferentially activated by 
particular frequencies and how such activation correlates with 
the facts relating to pitch discrimination 

THE EFFECTS OF LOUD SOUNDS 

It is a matter of common knowledge that a very loud sound 
produces sensations of discomfort, amounting to acute pain, 
and leaves the ear temporarily deaf and with a persistent ring 
ing, known as tinnitus Very violent explosive sounds may 
even cause permanent deafness In addition, the wave of 
-pressure: *m ‘hit ■Lidrthynqhi "juL ytrthym^u 'A.\ >i/p h;* uuW/u?., 
very loud sound may be sufficient to stimulate the sensory cells 
of the semicircular canals, the utricle and the saccule The 
subjective sensation is then one of vertigo, or of a sudden dis 
placement in space— a jolt The reflex response to such stimu 
lation is a sudden movement of the head, such as normally 
356 



THE EFFECTS OF LOUD SOUNDS 


357 


tends to compensate for an actual sudden change of position 
m space (Tulho, Bekesy, 20) The direction and character of 
the movement depend upon which of the labyrinthine sense 
organs are most strongly stimulated This movement is not 
to be confused with the orienting reflex m which the head is 
turned toward the source of sound The movements from 
direct labyrinthine stimulation are parts of the complex pattern 
of righting reflexes, evoked in this instance by an abnormal 
mode of stimulation For our present purposes their interest 
is merely that they show the violence of the pressure waves 
generated m the inner ear by very loud sounds 

Long-continued exposure to noise of considerable intensity 
is reputed to be a cause of deafness The term, boiler maker’s 
deafness, was corned to express this association of deafness with 
an occupation involving such exposure Adequate pathological 
studies of human ears deafened either by acute accident or by 
long-continued exposure to noise have not been made, and we 
shall not attempt to describe the few cases that have been 
examined Animal experimentation has, however, provided 
a fairly complete picture of the nature of detonation-deafness 
In the cochleas of guinea pigs exposed to the sound of 
revolver shots at close range, Guild (1) recognized, at post 
mortem examination, degrees of damage ranging from a loss of 
some of the external hair-cells, with or without derangement of 
the supporting cells, to cases in which the external hair-cells 
were all absent, the organ of Corti badly broken, and the inner 
hair-cells absent as well The distribution and the seventy 
of injury vaned from animal to animal, but tended, on the 
whole, to center at the middle of the cochlea Occasionally 
there were regions of senous injury adjacent to the stapes and 
round window The more severe injury near the center of the 
membrane tended to shade off gradually, through lesser degrees 
of injury on either side, to more or less normal organ of Corti 
toward the ends of the basilar membrane 

In another type of extensive injury, produced by brief ex 
posure to a very loud tone (Stevens, Davis, and Lune), part of 
the organ of Corti external to the tunnel becomes completely 



358 


“THE LOCALIZATION OF FREQUENCY RECEPTION 


detached from the basilar membrane, although the membrane 
itself and its arteries remain intact (see Fig 134) As was 
true for Guild’s animals, the internal hair-cells, although not 
detached from the basilar membrane, were usually severely 
damaged 

These lesions show, beyond a doubt, that the basilar mem- 
brane does actually vibrate violently in response to strong 
1 



Fic 13-t Section of [he cochlea of a guinea pig which had been exposed 
for 5 minutes to a tone of 400 cycles at 125 db above threshold The mtra- 
aural muscles had previously been rendered inactive by the local application 
of chloroform. Following exposure to the tone the threshold of the cochlear 
vmcrophonics was elevated by 50 to 70 db throughout the audible range 
The outer portion of the organ of Corn (1) has been dislodged and knocked 
into the upper left hand corner of die scala media The kisilar membrane (4) 
is still intact and so are Reissners (6) and the tectorial (2) The internal 
hair-cell (5) is still in its normal position although the tunnel (3) has been 
partially disrupted (After Stevens Davis and Lurie ) 

sounds The basilar membrane itself is apparently capable of 
withstanding violent mechanical agitation, but the organ of 
Corti is more vulnerable Furthermore, the external hair-cclls 
seem to be in a more exposed position than the internal cells 
It is a curious fact that the external hair-cells also seems to be 
the more vulnerable to drugs, toxins, and the effects of a thane 
ing age (Lune, 1) 



THE PLACE THEORY OF FREQUENCY RECEPTION 


359 


THE PLACE THEORY OF FREQUENCY RECEPTION 

We recognize in sounds an attribute which we term pitch 
Pitch correlates approximately with the frequency of the sound 
waves falling upon the ear The fact that we can differentiate 
sounds with respect to this attribute (cf Chapter 3) implies that 
there is some difference m the pattern of nerve impulses passing 
up the eighth nerve when the ear is stimulated by high as 
opposed to low frequencies There have been, in general, two 
schools of thought as to the nature of this difference One 
school has assumed that different groups of nerve fibers are 
activated by high and by low tones respectively, the other that 
the fundamental difference in the pattern of neural events lies 
in the frequency of nerve impulses, irrespective of the particular 
fibers which may be involved The first theory ascribes the 
discrimination of frequency to the analysis of sounds by the 
sense-organ itself, the second theory simply passes the problem 
of analysis along to the central nervous system without resolu 
tion 

One obvious principle by which physical systems can 
discriminate between mechanical vibrations of different 
frequencies is that of resonance This principle is that, when 
a series of structures of different natural periods of vibration are 
exposed to a force of a particular frequency, the one whose 
natural period corresponds most nearly to that frequency will 
vibrate with the greatest amplitude If there were in the inner 
ear a set of resonators tuned to different frequencies, we should 
have an adequate explanation, in physical terms, of sound 
analysis, and hence of pitch discrimination If the analysis is 
carried out m the central nervous system, we have no explana 
tory principle to offer, since there is nothing m the central 
nervous system which has the appearance of a set of tuned 
resonators, either mechanical or electrical Helmholtz was 
guided by this general principle when, following the lead of 
earlier writers such as Bell and Cotugno he elaborated his 
famous theory of pitch discrimination Helmholtz suggested 
that the rods of the organ of Com were the vibrating elements, 
but he later ascribed the response to the fibers of the basilar 
membrane The systematic anatomical differences between 



360 THE LOCALIZATION OF FREQUENCY RECEPTION 

the upper and lower ends of the basilar membrane with respect 
to its width and also to the size of the various structures of the 
organ of Corti attached to it, described in Chapter 10, were cited 
in support of this hypothesis 

Now, the evidence to be considered in this chapter demon 
strates that the auditory mechanism behaves, to some extent, as 
if it were a resonant analyzer of the sort envisaged by Helm 
holtz A maximum of disturbance occurs on the basilar mem 
brane at different places for stimuli of different frequencies 
That much appears certain That the maximum results from 
the operation of the simple principles of resonance, however, 
is extremely questionable The damping of the cochlear system 
is presumably too great for it to behave in a way analogous to a 
row of resonators, such as we have in the strings of a piano 
Consequently, the simple resonance theory of frequency recep- 
tion must be regarded as an oversimplification, and we must 
look to other principles for an explanation of the fact that a 
particular tone activates a particular region of the basilar mcm 
brane In other words, the terms resonance theory and place 
theory are not necessarily synonymous In Appendix II is a 
set of principles suggested to underlie the behavior of the 
cochlea and to make possible a place theory, while forsaking the 
principle of simple resonance 

Our purpose in this chapter will be to examine the evidence 
for the proposition that different areas of the basilar membrane 
respond specifically to different frequencies — in accordance 
with a place theory 

EVIDENCE FROM LONG EXPOSURE TO LOUD TONES 

Wittmaach, and others following him, exposed guinea pigs 
to the sound of a bell, a whistle, or a pipe, either continuously 
or for a. certain period each day for many days, and then ex 
amined the inner ears for pathological changes Their guiding 
motivation was to find a correlation between the frequency of 
the tone used and the location on the basilar membrane of a 
region of degeneration It was tacitly assumed that selective 
response of the basilar membrane would lead to destruction of 



EVIDENCE FROM LONG EXPOSURE TO LOUD TONES 


361 


the organ of Cortl only in that region of the basilar membrane 
activated by the exposure-tone. Unfortunately, the reports of 
the various investigators (cf. Kemp) are often at variance with 
one another and do not lead to clear conclusions. 

More recently, experiments of this type have been combined 
10 I 



Fig 135 Degeneration of external hair-cclls produced by long exposure to 
a loud tone The guinea pig was exposed continuously for 45 days to a tone 
of 2500 cycles at 106 db above human threshold The section is from the 
middle of the second turn The external hair-cells have degenerated com 
pletely, but the internal hair-cell is still present and apparently normal The 
spiral ganglion of Com appears to be norma! 

1 — Rassner s membrane 7 — degenerated external hair-cells, 

2 — tectorial membrane the nuclei has e entirely dis- 

3 — nucleus of normal internal hair appeared 

cell 8 — basilar membrane 

4 — nerve-cells of spiral ganglion 9 — stria vascularis 

5 — tunnel 10 — Hensen’s cells 

6 — supporting cells 

(Davis, Derbyshire, Kemp, Lurie, and Upton, unpublished ) 

either with tests of hearing of a functional sort, based upon con 
ditioned reflexes, or with tests involving the electrical activity 
of the cochlea. A dog was conditioned to withdraw his leg 
when any musical note was presented (Finch and Culler). His 



362 THE LOCALIZATION OF FREQUENCY RECEPTION 

normal threshold curve was determined by this method and he 
was then exposed to intense tones for various periods Eight 
een hours of exposure at 3000 cycles caused loss of hearing of 
40 to 50 db in all parts of the auditory range between 200 and 
5000 cycles Subsequent exposure to a tone interrupted 52 
times per mmute caused still further loss of hearing 

Intensities of stimulation approximately 100 db above thresh 
old arc usually required to produce a lesion of the cochlea in 
guinea pigs (Davis, Derbyshire, Kemp, Lurie, and Upton) 
With such intensities at a frequency of 2500 cycles, degeneration 
of the external hair cells occurs, as shown in Fig 135 The 
degeneration regularly centers in the middle of the second 
cochlear whorl, i e , near the middle of the basilar membrane 
The sensitivity of a number of such animals was determined by 
measuring the threshold of the aural microphonics Moderate 
impairment was found, but the loss centered at 1200 c>cles 
rather than at the exposure tone of 2500 cycles The loss of 
sensitivity corresponded reasonably with the severity and extent 
of the histological lesion 

The greatest elevation of threshold for the cochlear micro- 
phonics does not necessarily correspond to the frequency of the 
stimulatmg tone which produced the damage It is therefore 
unsafe to assume that deafness due to exposure will be specific 
for the frequency of the exposure tone There appears to be 
systematic displacement of the position of damage in the direc 
tion of the greatest sensitivity of the ear, that is, toward the 
middle of the auditory range, but this point requires further 
experimental investigation These facts, and also the extensive 
damage caused in some experiments by brief exposure to very 
loud tones, indicate that at high intensities of stimulation a 
wide area of the basilar membrane is set mto violent agitation 
The wide extent of the effect renders degeneration by excessive 
stimulation useless as a method for locating the specific regions 
of the basilar membrane which may he responsive to particular 
frequencies Nevertheless, many of these experiments did 
show localized regions of degeneration, and, when a restricted 
loss of sensitivity was detected by the electrical method, it 



EVIDENCE FROM HUMAN PATHOLOGY 


363 


correlated in general with such a localized region of degenera 
tion 


THE INJECTION OF DRUGS 

Experiments based upon the injection of drugs into the 
cochlea through the round wmdow or the placing of toxic 
substances, such as sodium chloride or cocaine, on the round 
window membrane (see p 337) lead to a similar conclusion 
histological damage to the sensory cells of the organ of Corti 
near the round wmdow is associated with an elevation of the 
threshold for cochlear microphonics at high frequencies of 
stimulation 

EVIDENCE FROM HUMAN PATHOLOGY 

Two clinical forms of deafness encountered in man are as 
sociated with more or less clearly defined degenerations of the 
organ of Corti or of the auditory nerve fibers associated with 
particular regions on the basilar membrane 

Gradual High Tone Deafness The first of these, already 
described m Chapter 2 (Fig 22), is the progressive loss of hear 
mg for tones of high frequency which occurs with advancing 
age In this type of deafness, sensitivity falls off gradually 
with increasing frequency of the test tone This condition is 
correlated with partial atrophy of the auditory nerve supplying 
the basal turn of the cochlea (Crowe, Guild, and Polvogt) 
Abrupt High Tone Deafness In the other, less common, 
form of high tone deafness the audiograms show much more 
abrupt breaks, as illustrated in Fig 20 (p 61) and in Fig 
136, which may be correlated much more precisely with the 
atrophy of auditory nerve fibers and the degeneration of the 
organ of Corti found post mortem Histological study (Crowe, 
Guild, and Polvogt) of 79 ears of this type proved quite con 
clusively that the receptors for high tones are located in the 
basal turn of the cochlea More specifically, a statistical analysis 
of the data (Ciocco) shows that the lower end of the area recep 
tive to 2048 cycles is more than 9 5 mm and less than 12 mm 
from the basal end The upper boundary of the area for 4096 



364 THE LOCALIZATION OF FREQUENCY RECEPTION 

cycles is definitely more than 73 mm and less than 95 mm 
from the basal end, and the region sensitive to 8192 cycles is 
apparently located approximately 5 mm from the end Figure 
136 presents the audiogram and Fig 137 a graphic summary of 
the abnormalities of one of these ears 

There are exceptions in this series of observations, in which 
the abrupt type of loss is not 
associated with a sufficient 
degree of nerve atrophy to e\ 
plain the impairment of hear 
mg which was measured 
ante mortem In approxi 
mately one fourth of the ears 
no lesion was found in either 
Fro 136 Aud ogram sho vmg abrupt the middle or inner ear that 
h gh tone deafness (After Crowe adequately explained the 
Guild and Pols ogt) hearing loss These ears have 

less impairment of hearing than the group as a whole, and it is 
possible that their losses were due to early organic changes in 
the cochlea which the histological technique was not adequate 
to demonstrate On the other hand, the impaired hearing in 
these ears may have been due to lesions of the central auditory 
pathways These studies show, incidentally, that nerve atrophy 
does not necessarily precede atrophy of the organ of Corti 
There is apparently a nutritional relationship between the ex 
ternal sulcus cells and the sensory cells, since atrophy of the 
former usually precedes that of the latter No etiology for 
high tone deafness can be suggested here, but the observations 
are of great theoretical importance for the understanding of the 
mechanism of hearing since they arc apparently the only ones 
m which a correlation has been made between anatomical 
changes in the sense-organ of an individual and quantitative 
studies of his subjective hearing 

EVIDENCE FROM THE COCHLEAR MICROPHONICS 

As we have already seen (Chapter 13), measurements of 
the cochlear microphomcs show that the cochlea is differentially 




EVIDENCE FROM THE COCHLEAR MICROPHONICS 


365 


‘tuned,’ i.e., one end responds preferentially to low tones and 
the other to high tones. The only reasonable interpretation is 
that the electromechanical activity associated with the low tones 
is located in a position more favorable for detection by the elec- 
trode at the apex, and that the activity in response to high tones 



Fic. 137 Chart of the pathological changes in the basal turn of the human 
ear whose audiogram is shown in Fig 136. The transition from normal to 
abnormal is unusually abrupt in this case. 

Inner spiral, nerve-fibers, black = normal, white — degenerated 
Second spiral, organ of Com, rectangle with large dots = organ of Corti 
with normal hair-cells, plain line = organ of Corti degenerated. 

Third and fourth spirals; black = normal and white = abnormal external 
sulcus cells and stria vascularis respectively. 

Outside the spirals are indicated the limits of the zones required for the 
reception of frequencies 8192, 4196, and 2048 The most probable locations 
for 8192 and 4196 are indicated hy the shaded rectangles. The locations are 
based on an analysis of the abnormalities of 79 human ears and their audio- 
grams (After Crowe, Guild, and Polvogt) 

is located nearer to the round window. The relation is the same 
in the guinea-pig and in the cat. It is illustrated in Fig. 123 
(p. 314) by the crossing of the threshold curves for the micro- 
phonics when the ‘active’ electrode is moved from the apex 
to the round window. 



3 66 


THE LOCALIZATION OF FREQUENCY RECEPTION 


A much more precise correlation of location on the basilar 
membrane with frequency has been made by drilling into the 
cochlea and producing local mechanical disruption of the organ 
of Corti (Stevens, Davjs, and Lurie) The thresholds of the 
aural mtcrophonics to various frequencies of stimulation were 
determined before the operation and were again determined 
immediately afterwards In some cases a general loss of sen 
sitivity ensued, but usually the threshold of the microphonics 
remained, within the limits of observational error, unaltered 
at most frequencies The threshold for certain frequencies, 
however, was elevated, and, as a rule, the transition between 
normal sensitivity and reduced sensitivity was sufficiently abrupt 
to be located within a half octave on the frequency scale The 



following local surgical damage to the cochlea No 59 shows a loss to low 
tones No 70 shows a loss in the middle range without loss at c ther extreme 
(From Stevens Davis and Lurie ) 

cochleas were subsequently examined histologically, and die 
transition between normal and abnormal organ of Corti was 
found to be fairly sharp In most instances l the transition 
could be located to within about 1 mm 

Illustrative audiograms, showing the differences between 
die threshold curves taken before and after the operative dam 
age are shown in Fig 138 In general, complete destruction 
of part of the organ of Com does not elevate the threshold for 



EVIDENCE FROM THE COCHLEAR. MICROPHONICS 


367 


any tone by more than 30 db This is true even when more 
than half the organ of Com is destroyed, and suggests that 
there is a wide spread of the disturbance on the basilar mem 
brane The spread of vibration is apparently greater for low 
than for high tones In the audiograms of Fig 138, the points 
at which the response departs abruptly from normal correlate 
with the boundaries between functional and nonfunctional 
hair-cells Thus, number 59 in Fig 138 shows a sharp drop 
at 1750 cycles and depression of all tones below that frequency 
Number 70 shows a departure from normal at about 600 cycles 
and also at about 2250 cycles Such departures were correlated 
with the location of histological damage to the organ of Corti 
Figure 139 summarizes the results of this series of expen 
ments on guinea pigs Position on the basilar membrane is 
represented along the ordinate and frequency along the abscissa 
Each rectangle correlates the border of a lesion with an abnor 
mahty in an audiogram The width of the rectangle indicates 
the range of frequency within which the deviation from normal 
sensitivity occurred, and its height represents the width of the 
zone on the basilar membrane separating definitely normal 
from definitely abnormal hair-cells The circles correlate the 
centers of isolated depressions in the audiograms with the cor 
responding circumscribed zones of damage to the hair-cells 
The band determined in the chart by the rectangles and circles 
therefore indicates the positions on the basilar membrane at 
which tones of various frequencies are received when they are 
near the threshold of audibility 

Relation to Human Frequency Discrimination The solid 
curve in Fig 139 was not drawn to represent the curve best 
fitting the rectangles and circles The solid curve was obtained 
independently from an integration of DL’s for frequency, as 
measured in human ears (see p 94) The data given by 
Shower and Biddulph were integrated at the sensation le% el of 
40 db and plotted on a scale (right hand ordinate scale of Fig 
139) which was adjusted to make its total length correspond to 
the length of the basilar membrane Integration of DL’s at 
other sensation levels or loudness levels would give a different 



368 


THE LOCALIZATION OF FREQUENCY RECEPTION 


value for the total number of DL’s in the audible range, but 
the form of the curve relating them to frequency would be 
essentially similar to that m Fig 139 (cf Fig 35, p 96) In 
making this adjustment of scales we assume that the minimal 
detectable difference in frequency corresponds to the minimal 
detectable distance between two adjacent regions of excitation 



Fic 139 The correlation between the position of damage along the basilar 
membrane and the associated changes irt the audiograms The width of each 
rectangle represents the frequency range within which the dev taiion from 
normal sensitivity occurs, and its heght represents the zone on tie basilar 
membrane separating definitely normal from definitely abnormal hair-cells 
The centers of the circles indicate the centers of peaks or depressions in the 
audiograms and the centers of isolated normal regions or zones of damage of 
the organ of Corti Numbers refer to the audiograms shown in Fig 138 
The solid line represents the integration of Shower and Biddulphs data for 
human pitch-discrimination as explained in the text (From Stevens, Davis, 
and Lurie ) 

on the basilar membrane, and that this distance is constant 
throughout the length of the cochlea This is a reasonable 
assumption, since the hair-cells are distributed rather evenly 
along the membrane The striking correspondence between 
the results of experimental destruction of parts of the cochleae 
of guinea pigs and the evidence derived from psychological 



EVIDENCE FROM THE COCHLEAR MICROPHONICS 


369 


determinations of the capacity for the discrimination of pitch 
testifies to the validity of both methods 

The integration shown by the curve m Fig 139 indicates 
that the human ear can distinguish, at the sensation level of 40 
db, about 1300 tones between the lowest and the highest audible 
frequencies Since integration at higher levels yields a greater 
number (p 94), we may take the number 1500 as a more 
representative figure This value means, in terms of our earlier 
assumption, that two tones can be differentiated in perception 
provided they stimulate patches on the basilar membrane differ 
mg in position by 0 02 mm This compares with an approxi 
mate sensibility of 1 mm on the tip of the tongue and 23 mm 
on the finger tips There are approximately 2500 internal hair 
cells m the human cochlea, and it is interesting to note that 002 
mm on the basilar membrane is almost the distance occupied 
by two internal hair-cells The significance of this observation 
lies in the fact that a smgle nerve fiber connects with one or 
two hair cells (cf Chapter 10) This type of innervation sug 
gests that, in order to account for pitch discrimination, the num 
ber of hair-cells should exceed the number of discriminate 
differences Hence, the relation of approximately two internal 
hair cells to one DL appears reasonable 

The internal hair cells, since they are rather simply in 
nervated, should he responsible for pitch discrimination at its 
best On the other hand, the location of the internal cells at 
the edge of the basilar membrane, in a region relatively pro- 
tected from mechanical agitation, suggests that their threshold 
should be higher than that of the external cells Direct evi 
fras tiva TTCtoon *«as cfeftavsYtd from o?ie isamsl, a wft, 
which showed 30 to 40-db hearing loss for all tones and which 
upon examination revealed degeneration of the external cells 
only (Stevens, Davis and Lurie) The fact that the threshold 
for internal cells appears to be from 30 to 40 db above that for 
the most sensitive external cells suggests an explanation for 
the finding of Shower and Biddulph (cf Fig 31, p 88) that 
differential sensitivity to frequency is much less near threshold 
than it is for tones 40 db above threshold Indeed, an Integra 



370 THE LOCALIZATION OF FREQUENCE RECEPTION 

tion of the DL s obtained at 5 db above threshold yields about 
500 discnminable tones as against 1300 at 40 db In other 
words, the multiple innervation of the external hair cells (sev 
eral hair cells innervated by one fiber and several fibers connect 
ing with each hair cell) appears to be reflected in poorer differ 
entiation of tones when the tones are so weak that the external 
cells alone are activated 

Finally, we have in Fig 140 a representation of the position 
along the basilar membrane of maximal sensitivity to various 
frequencies The locations given in this figure for the recep- 
tion of high tones by the human ear are entirely consistent with 
the positions ascribed to them by Crowe and his associates 



Fic 140 The local zat on of f equency recept on on the bat lar membranes 
of man and gu nea p g (From Stevens Dav $ and Lur c ) 


(p 363) The lower octaves are greatly crowded at the apical 
end of the cochlea This crowding explains the difficulty cn 
countered by most efforts to prove localized response of the 
basilar membrane to low tones and it also explains why the 
differential sensitivity of the ear is relatively poor at low fre 
quencies 

Further Evidence from Antmal Experimentation The 
method employed in the experiments described above suffers 
from one obvious criticism The cochlea has been damaged by 
entering it It is always possible that the anatomical derange 
ment has altered the mechanical characteristics of the system 
and thereby altered the position of maximal sensitivity on the 
basilar membrane to a particular tone Culler has reported a 
series of experiments on the guinea pig free from this defect 
His procedure was to record the microphomcs of the cochlea 



EVIDENCE FROM THE COCHLEAR MICROPHONICS 371 

from twenty-five different points on its external surface and to 
determine the intensity of stimulation necessary to obtain an 
arbitrary but very small response at various frequencies. For 
each point a particular frequency could be found which gave 
the arbitrary threshold response at a lower intensity of stimula- 
tion than was necessary for any neighboring point. This find- 
ing indicates that the microphomc for that particular frequency 
of stimulation is generated near the point in question. The re- 
sults of Culler’s experiment are plotted in Fig 141. In this 
representation, the cochlea appears coiled as though viewed 



Fig MI. The optimal positions in the guinea pig for detection of the 
cochlear microphonics near threshold at the frequencies shown The cochlea 
•si-feprestnlreh'as 'a'SprrJr vrev.’eh'mmi nqronn xm its laxis. Tne'niucdtreina is 
at the center. (After Culler ) 

from a position along the axis of the spiral. The various maps 
of the cochlea agree quite well with one another, although the 
relative crowding of the lower octaves into a small space toward 
the apex of the cochlea is less extreme in Culler s map. A pos- 
sible factor in Culler s method is that the lines of current-flow 
may be deformed by variation in the thickness of the bony wall 


372 THE LOCALIZATION OT FREQUENCY RECEPTION 

of the cochlea and the locations of certain points systematically 
displaced for this reason All our present evidence taken to- 
gether would indicate that Cullers points are all slightly dis 
placed in the direction of the round window Nevertheless, 
we may conclude with certainty that a weak tone of given fre 
quency activates selectively one particular region of the basilar 
membrane whose location is expressed by a map having the 
general features of the maps m Fig 140 

Other studies by the electrical method following the produc- 
tion of local surgical lesions give confirmatory evidence m the 
form of a gross localization High tones are localized toward 
the round window and low tones toward the apex, but m most 
of these experiments the localization of particular frequencies 
was not precise The lack of precision is reasonably explained 
by the fact that these other studies employed intensities of 
stimulation far above threshold Apparently, localization of 
activity is sharply confined only at threshold, and the area of 
vibration spreads extensively up and down the basilar mem 
brane as the intensity is increased 

THEORIES OF FREQUENCY DISCRIMINATION 

It is unreasonable, of course, to suppose that a single hair 
cell is ever stimulated separately There is always some spread 
of excitation along the basilar membrane, and, at high inten 
sities, the spread may cover the entire length of the cochlea 
How, then, are we to account for the remarkable ability of the 
ear to resolve small differences of frequency ? Certainly there 
must be some principle by which one broad pattern of stimula 
tion can be distinguished from another very similar one, which 
is displaced only 0 02 mm from the first 

5 Ass>hJ/t •jr.m/'jjjle. tt that of maximal stimulation f Wil 
kinson and Gray) Presumably, every pattern of basilar activity 
has at least one maximum where excitation is greater than at 
points on either side The position of this maximum may 
determine the pitch of a tone Where several maxima occur, 
due to a complex sound, the ear is able to distinguish the several 



THEORIES OF FREQUENCY DISCRIMINATION 


373 


components of the sound This principle has been tacitly as- 
sumed in most efforts to explain auditory phenomena 

Objections have been raised against the principle of maximal 
stimulation on the grounds that the maxima are not sharp, that 
sometimes two tones can be heard when there is reason to 
believe that only one maximum exists, and that, m low tones, 
pitch may be perceived even though it is probable that the 
pattern of excitation has no true maximum at all In an effort 
to circumvent the first two of these objections Bekesy (4) 
pointed to an experiment by Mach m which it was shown that, 
when a visual field contains two levels of brightness with a 
gradual transition from one level to the other occurring in the 



Fjc 142 Showing how an abrupt change of gradient in a visual stimulus 
produces a salience in sensation A stimulus whose intensity varies from level 
I to level 2 produces the appearance of a dark ring at A and a bright ring at B 

region separating the two levels, a bright and a dark ring can be 
seen where we should expect to see only the beginning of a 
smooth change m brightness This effect is illustrated schemat 
ically in Fig 142 The stimulus changes linearly from level 1 
to level 2, but the observer perceives a dark ring at A and a 
bright ring at B Bekesy also pointed out that, when a stimulus 
of the form shown in Fig 143 is pressed agamst the skin of the 
arm, it produces the impression of an object having two ridges, 
as shown in the same figure These facts suggest that when a 
sensitive surface of the body, either in the eye or on the skin. 



374 


THE LOCALIZATION OF FREQUENCY RECEPTION 


is subjected to a stimulus m which there is a change of gradient 
from place to place, the change of gradient stands out promi 
nently m sensation If we regard the basilar membrane as 
such a sensitive surface of the body, we might expect to find a 
similar effect— an effect which would make the relatively sharp 
change of gradient in excitation at the maximum of a basilar 

| 1 disturbance stand out in 

\ / sharp contrast to the rest of 

the disturbance 


seNSATON^ — IT . J The application of this 

\/ V/ principle of gradients to the 
Fig 143 Showing how a change of case of Stimulation by two 
gradient affects the sensation produced tones which are near together 
b, a acral nmol- frequency and wh.eh stun 

ulate overlapping regions on the basilar membrane is illustrated 
in Fig 144 The two excitations sum to give a rather fiat 
topped pattern of stimulation m which there arc two sharp 
changes of gradient At each of these places we should expect 
the sort of salience depicted by the dotted curves in the figure 
As an instance m which this principle offers interesting pos 
sibihties for explanation, we may recall the experiment by Youtz 
and Stevens, in which they found that an observer can identify 
the components of a frequency modulated tone when the com 
ponents are spaced apart by only 8 cycles (sec p 241) 

It has been urged that the principle of maximal stimulation 

cannot apply to the percep - 

tion of the pitch of low tones, /'„ 

because a frequency near the 
lower limit of hearing is sup- 

posed to stimulate the basilar Fig 144 The summated excitation 
membrane as vigorously im duc *° ,hc individual d sturbanees A 
mediately at the hehcotrema 

as at a short distance away saliences shown by the dotted curves 
We have no direct evidence 


that such is the case, but, assuming it to be true, we might still 
conceive how one low tone could be distinguished from another 
by reason of its pattern of stimulation The principle of gra 



THEORIES OF FREQUENCY DISCRIMINATION 


375 


dicnts could presumably apply to the situation represented in 
Fig 145, in which there is no genuine maximum, but in which 
there is an abrupt change of gradient whose position depends 
upon the frequency of the stimulating tone The obvious 
difficulty with this theory is that we have, as yet, no direct 
evidence that the gradient of excitation by low tones changes 
in the manner required For that reason we should give due 
consideration to other principles of explanation 

It may simply be that, for the low tones, the general form 
and extent of the pattern of excitation is enough to permit the 
discrimination of frequency with 
out appeal to the principle of 
maximal stimulation, or to the 
principle of gradients In other 
words, it is possible that we are 
sensitive to the gross location of 
a pattern, rather than to some 
specific feature of it 

Present evidence does not per 
mit us to state definitely by what 
aspect of basilar excitation one 
tone is distinguished from an 
other in pitch Of this, however, we may be certain the 
position of basilar disturbance is related to the perceived pitch 
of a tone We have seen, furthermore, that there are open to 
us certain reasonable possibilities for explaining the high resolv 
mg power of the ear Among these possibilities future experi 
ments may decide 



BAS LAR MEMBRANE 
Fig 145 A representation of 
the manner in which a change of 
gradient in the pattern of exata 
tion due to low tones may give 
rise to perceptible saliences Curv e 
A would be for a lower frequency 
than curve B 



CHAPTER 16 

AUDITORY NERVE. IMPULSES 

From the point of view of the psychophysiology of auditory 
sensation, the auditory nerve stands in the position of a gateway 
between the sense organ and the brain The afferent impulses 
which form the physiological basis for the whole range of 
auditory sensation must pass through this single ‘bottle neck * 
The nerve is not scattered anatomically, like the nerves of the 
cutaneous or the proprioceptive senses, and, theoretically, the 
entire primary sensory input can be assessed and measured at 
this point Sound waves initiate movements in the structures 
of the inner ear which we can study and analyze by means of 
the aural microphonics, but it is only by way of the impulses 
m the auditory nerve that the sense-organ can affect the higher 
nervous centers and lead to the reactions which we call sensa 
tions The impulses which pass up the auditory nerve must 
possess certain properties of number, distribution among the 
various fibers, temporal sequence, etc , which are the correlates 
of and the basis for the various psychological attributes of sound 
Knowledge of the neural correlates of the discnminable aspects 
of sound has an additional general interest in that it may reveal 
what types of relations among nerve impulses are capable of 
central discrimination 

ANATOMICAL CONSIDERATIONS 

The distribution of the terminal branches of the auditory 
nerve fibers to the hair cells and the arrangement of the cell 
bodies of the primary sensory neurons in the spiral ganglion 
of CorU have been described in Chapter 10 The axons of 
the ganglion cells are gathered together within the modiolus 
376 



ANATOMICAL CONSIDERATIONS 


377 


to form the auditory portion of the eighth cranial nerve At 
the point where this nerve passes through the petrous bone, it 
is associated with the vestibular portion of the eighth nerve 
which serves the sense-organs of the simicircular canals, the 
saccule, and the utricle The entire nerve emerges through the 
internal auditory meatus The intracranial course of the eighth 
nerve is short, not more than 4 or 5 mm in the cat, for the nerve 
immediately enters the medulla in the region of the cochlear 
nucleus and tuberculum acusticum (see Fig 112, p 270) 

The fibers of the auditory nerve show a curious spiral 
arrangement Those fibers which connect the cochlear nucleus 
with a region about one-quarter of the length of the basilar 
membrane from the round window run straight and constitute 
the axis of the nerve Around these fibers the other fibers are 
twisted— those gomg to the apex, m one direction, those going 
to the basal region, m the opposite direction Thus, the twist 
ing corresponds to the coiling of the cochlea, and the nerve as a 
whole is twisted somewhat like a rope The genesis of this 
arrangement may be readily understood from the embryological 
development of the cochlea, for the nerve fibers are dragged, 
so to speak, after the organ of Com as it grows out into its 
final spiral form (Lorente de No, 2, Poljak, 1) 

Upon entering the medulla, each fiber of the cochlear nerve 
divides, as shown in Fig 146, into an ascending branch whrh 
goes to the ganglion ventrale and a descending branch which 
enters the tuberculum acusticum The ascending and descend 
ing branches are arranged in parallel bundles If the points of 
division of the primary entering fibers are projected on a longt 
tudmal plane, they form a slightly curved line This lme is the 
caudal boundary of the ganglion ventrale, but what is more 
significant is that it represents a projection m the cochlear 
nucleus of the organ of Corti There, if we imagme the gan 
glion of Corti to be uncoiled, the highest point in the line of 
bifurcations of the nerve fibers corresponds to the apical end 
of the ganglion and the lower end to the basal part of the 
ganglion We shall see later (p 432) that a similar ‘map’ of 
the organ of Corti has been discovered m the medial geniculate 



378 


AUDITORY NERVE IMPULSES 


body This discovery is not surprising in view of the orderly 
arrangement of the fibers m the tract connecting the cochlear 



Fic. 146 Longitudinal sect on through the primary acoustic nuclei of a 
four-day-old cat prepared by the method of Golgi 
G v — ganglion ventrale of the cochlear nucleus 
pn — posterior nucleus 
T.a — tuberculum acusticum 

I II and Ilf indicate regions of specific structure within the tuberculum 
acusticum and the ganglion ventrale 
C-f — centrifugal fibers from higher auditory nuclei 
Nc — fibers of the cochlear auditory nerve with anterior branches (a) for 
the ganglion ventrale and posterior branches (p) for the tuberculum acusticum 
anb posterior tunbeus 'Tin: ’imt ih "Inc ’ufiurcrtfum dr "hrese fiteers Tcprescwo 
a map or projection of the cochlea in the cochlear nucleus The fibers from 
the apex of the cochlear branch near the center of the figure and those from 
the central part of the cochlea near the bottom of the figure (Lorente de 
Nd 2) 

nucleus with the medial geniculate body An equally orderly 
arrangement of fibers in the auditory radiations which connect 


ELECTRICAL DETECTION OF IMPULSES IN AUDITORY NERVE 379 


the medial geniculate with the temporal cortex argues strongly 
for a third projection of the organ of Corti in the cerebral cortex 

ELECTRICAL DETECTION OF IMPULSES IN THE 
AUDITORY NERVE 

The action potentials of the auditory nerve may be recorded 
by means of appropriate electrodes placed in the auditory nerve 
or on the cochlea itself Within the medulla the problem of 
recording becomes more complex, for the primary neurons of 
the auditory nerve stimulate secondary neurons in the cochlear 
nucleus and, within the nucleus, both primary and secondary 
impulses may be detected by the electrodes Recording from 
the cochlea yields action potentials as well as aural micro- 
phonics, but the microphomcs are the more powerful and can 
not be suppressed by any procedure which does not also abolish 
the nerve impulses Only when we make the microphomc very 
brief, as in a faint click, so that the nerve impulses, by virtue 
of their longer latency, appear after the microphomc, can we 
record uncomplicated action potentials from the cochlea 

Ordinary wire or wick-electrodes applied directly to the 
auditory nerve detect not only the action potentials but also the 
aural microphomcs, because the microphomcs spread widely 
from the cochlea, particularly along moist surfaces such as the 
meninges of the brain The microphomcs may be excluded 
by employing coaxial electrodes, which are made by passing a 
fine insulated wire down the bore of a hypodermic needle The 
wire is held in place by insulating cement, and ground off flush 
with the bevel of the needle The wire serves as the ‘active’ 
electrode and is connected to the input grid of the amplifier, 
while the needle itself acts as the ground or ‘ reference’ electrode, 
and also provides the necessary mechanical support Electrodes 
of this type are very useful for electrical exploration within the 
substance of the brain, for they pierce the brain with relatively 
slight trauma The radius within which electrical effects are 
picked up by coaxial electrodes is quite small, so that it is pos 
sible to determine with fair accuracy the location of the active 
nerve fibers responsible for the potentials Moving the needle 



380 


AUDITORY NERVE IMPULSES 


a millimeter or less may reduce the electrical record of impulses 
in a particular tract to as little as one tenth of its former size. 

CHARACTERISTICS OF SIMPLE AUDITORY 
ACTION POTENTIALS 

The simplest auditory action potential would consist of a 
single nerve impulse in a single auditory fiber This degree of 
simplification has been reasonably approximated by using as a 
stimulus a click whose strength is just above threshold With 
this stimulus, very few fibers are simultaneously stimulated and 
only one impulse is initiated in each Such a volley may be 
recorded either at the round window or in the auditory nerve 
(Fig 147) 

Duration The duration of the action potential in a fiber of 
the auditory nerve is less than 1 msec In both duration and 
wave form it resembles the action potentials of other medul 
lated fibers of similar diameter 

Velocity of Conduction The time interval between the ap 
pearance of the action potential in the cat s cochlea and in the 
auditory nerve is about 0 15 msec The distance from the basi 
lar membrane to the usual position of the electrodes in the nerve 
is about 4 mm The velocity of conduction is therefore about 
30 meters per second This is similar to the velocity of im 
pulses in cutaneous sensory nerves and in the optic nerve 
The figure is only approximate, because the length of fiber 
traversed is not accurately known, and also because it is assumed 
that the action potential is registered at the round window as the 
impulse crosses the basilar membrane and before it enters the 
bony structure of the modiolus The assumption seems reason 
able, however, m view of the good insulating properties of the 
petrous bone 

These measurements point clearly to the physiological simi 
lanty of the auditory nerve and other sensory nerves, and they 
render improbable any assumption that the fibers of the auditory 
nerve possess special properties which would provide them with 
an unusually brief refractory period 

Polarity The polarity of the action potentials from the 



SIMPLE AUDITORY ACTION POTENTIALS 


381 


cochlea or from the auditory nerve is constant For example, an 
electrode on the round window always records a large initial 
negative peak which may or may not be followed by secondary 
waves The first action potential wave is always negative, 
whether the initial wave of the cochlear microphonic preceding 
it is negative or positive # The electrical sign of the first micro 
phonic wave depends upon whether the first sound wave is one 
of negative or of positive pressure (see p 318) When the 
sound is a click generated by discharging a condenser through 
a loud speaker, the direction of the initial pressure wave and 
the polarity of the initial microphonic wave may be reversed by 
reversing the electrical polarity of the condenser-discharge 
The ensuing action potential, however, remains negative (Fig 
147 B,), just as the action potential in any other nerve remains 
negative regardless of the nature or polarity of the stimulus 
which initiates the impulse This uniform polarity of the 
action potential provides a ready means for distinguishing the 
two components in the electrical pattern at the round window 
following an impulsive stimulus, and it attests the fundamental 
difference in the origins of the action potentials and the cochlear 
microphonics 

Latency The action potential produced by a click is not 
simultaneous with the first wave of the cochlear microphonic 
The action potential recorded from the round window follows 
the microphonic by at least 0 55 msec, and sometimes by as much 
as 2 0 msec, depending upon the polarity and intensity of the 
stimulus These relations and their implications will be con 
sidered later in detail 

Vulnerability In the event of death to the experimental 
animal, or the interruption of the oxygen supply to the cochlea, 
the action potentials disappear before the microphonics, al 
though the microphonics fall to a fraction of their original 

* The polarity and form of action potentials recorded by coaxial electrodes 
placed in the auditory nerve or brain depend upon the position of the elec- 
trodes relative to the active fibers The form is rarely monophasic sometimes 
diphasic and often triphasic This complexity is due in part to the geometry 
of the electrodes and in part to the complex conducting and shunting effects 
of the surrounding tissue 



382 


AUDITORY NERVE IMPULSES 


intensity at the death of the animal. The action-potentials in 
the higher auditory pathways, beyond the synapses of the 
cochlear nucleus, are even more vulnerable than those of the 



F:c M7 Standing wave oscillograms of the cochlear microphonics and 
action potentials in response to dicks, recorded from the round window and 
from the auditor) nerve The records from the nerve were obtained with 
coaxial electrodes and show action potentials without microphonics The 
response consists of three, and perhaps four, fairly discrete waves These same 
waves appear in the round window records, but there they are superimposed on 
the later waves of the cochlear microphonia 
In B the sUmulus is the same as in A except that its polarity has been reversed 
Note that the first wate of the cochlear microphonic is inserted m B but that 
the action potentials retain the same polarity in B as in A 
In C both records are from the round window The first corresponds to B 
except that the stimulus is stronger In the second a hissing sound is delivered 
to the cat's ear in addition to the dicks The base line is broader, due to the 
microphomcs and acUon potentials of the hiss which are not synchronized with 
the sweep of the oscillograph The cochlear microphomcs of the click are 
unaffected by the hiss, but the action potentials are greatly reduced or masked ’ 
(Derbyshire and Davis, 2 ) 

auditory nerve, and they may be abolished by deep surgical 
anesthesia. 


SIMPLE AUDITORY ACTION POTENTIALS 


3$3 


Threshold. The threshold intensity required to initiate an 
observable action potential depends upon two factors: the 
thresholds of the individual nerve-fibers, and the number of 
fibers which must be simultaneously excited m order to generate 
a detectable potential across the electrodes The exact place- 
ment of the electrodes m relation to the active fibers is critical 
in determining whether or not the impulses from a few fibers 
will be detected A fortunate placement makes it possible to 
detect simultaneous activity in a very few fibers, perhaps five 
or six, but there is as yet no clear evidence for the successful 
detection of impulses in a smgle auditory fiber Since the 
nerve-impulses are all-or-none in character (see Chapter 12), 
the problem of their detection hinges upon the number of fibers 
stimulated and upon the placement of the electrodes, and not 
upon any increase in the size of the individual impulses 

With good placement of the electrodes in the auditory nerve, 
the threshold for action-potentials m response to clicks (or low 
tones) is within a few decibels of the threshold for the aural 
microphonics at the round window This correspondence is 
convenient, but presumably fortuitous, and it has no theoretical 
significance, except as it justifies the practical use of the thresh- 
old for the microphorucs as an experimental measure of the 
sensitivity of the ear. In some experiments the threshold for 
action potentials, in the higher auditory pathways of the cat, 
has been found to be actually lower than the threshold for the 
corresponding aural microphonics, and quite as low as the 
average threshold for human hearing (Kemp, Coppee, and 
Robinson). The microphonic threshold, it will be recalled, is 
defined as an arbitrary — just detectable — potential (see p 313) 
Relation to Sound -Intensity. The intensity of short impul- 
sive stimuli affects both the amplitude and the latency of the 
resulting action potentials. 

Amplitude. As the intensity of a click is increased, both 
the cochlear microphonic and the action-potential grow larger 
Near threshold, the increases can be measured satisfactorily 
from the round window (see Fig 148 A) but, at 30 or 40 db 
above threshold, measurement becomes difficult, because the 
mechanical response of the ear to a click is not critically damped 



384 


AUDITORY NERVE IMPULSES 


(see p. 262 ) and the action-potentials are superimposed on the 
later waves of the microphonics. The degree of complexity 
which this introduces depends in part on die sharpness of the 
click and the shape of the resulting microphonic pattern. 
Precise measurements are therefore impossible at high sound- 
intensities, but it can be seen that the action-potential continues 



Fio 148 The amplitude of the lint wave of the action potential as a 
function of the intensity of a stimulating dick. 

A Recorded from the round window of a cat The growth of the aeuon 
potential diSers strikingly from the growth of the cochlear microphonic 
B Recorded by coaxial electrodes from the auditor)' nerve of the same 
animal. This curve differs from both of the curves in A The latency of the 
action potential of the nerve was 1 msec longer than that of the first action- 
potential wave at the round window The unusually long difference in latency 
and the different shapes of the amplitude curves show that the action potentials 
of A and B probably do not belong to the same nerve-fibers This fact 
illustrates the difficulty in drawing conclusions from measurements of a random 
sample of neural activity detected by coaxial electrodes (After Derbyshire 
and Davis, 2) 

to increase, as well as the cochlear microphonic, although they 
do not necessarily follow the same law of increase. 

Growth in the size of the action-potential implies increase 
m the number of fibers stimulated, but it is entirely uncertain 
how linearly the size of the action-potential registers this 
numerical increase, for all fibers arc not located in equally ad- 
vantageous positions with respect to the recording electrodes. 
This difficulty is particularly serious when coaxial electrodes are 



SIMPLE AUDITORY ACTION POTENTIALS 


385 


inserted into the auditory nerve or higher tracts (see p. 422). 
Therefore, the law of growth of the action-potentials remains 
uncertain, and we cannot make significant quantitative com- 
parison between it and psychological data. Furthermore, as 
will appear below, the action-potentials in response to a click 
do not represent a single synchronized volley of impulses but 
a series of volleys of various latencies, originating at various 
positions along the basilar membrane. The data of Fig. 148 are 
based on the measurement of only one, the earliest, of these 
volleys. 

Latency. The latency of the nerve-impulses diminishes 
with increase of intensity of the stimulating click. This change 



Decibels above Threshold 

Fig 149 Latency of the first wave of the action potential at the round 
window in relation to the intensity of the stimulating click The latency was 
measured from the first major negative peak of the cochlear microphomc to 
the foot of the first wave of the action potential The latency diminished by 
0 29 msec as the intensity was increased to 30 db, but at greater intensity »t 
remained constant. The first half-cycle of the cochlear microphomc occupied 
0 26 msec (Derbyshire and Davis, 2 ) 

can be accurately measured for the first volley of impulses and 
it appears to characterize the later volleys as well. In the par- 
ticular case illustrated in Fig. 149 the latency diminished from 
0.84 msec to 0.53 msec as the stimulus was increased from thresh- 
old to a level of 30 db above threshold. Further increase in 
intensity apparently does not further diminish the latency of 
the nerve-impulses. 




386 


AUDITORY NERVE IMPULSES 


THE RESPONSE TO STRONG IMPULSIVE STIMULI 
The action potential wave in response to a single click is not 
Simple except near threshold When the intensity of the sUmu 
lus is increased by a few decibels, additional action potential 
waves appear in the auditory nerve, giving the composite wave 
illustrated in Figs 147 and 150 The group is usually com 
posed of three major waves which differ in latency, maximal 
amplitude, and threshold These waves merge more or less into 
one another The pattern differs in detail, according to the 
frequency spectrum of the click, but the impulses are consid 
erably dispersed in time, except when a stimulus very near 
threshold is employed The earliest wave is sharp and promi 
nent when the click is generated by passing a condenser-dis- 
charge through a loud speaker designed for high tones (3000 to 
10,000 cycles) The early wave is less prominent when a loud 
speaker designed for a lower range (60 to 4000 cycles) is used 
This difference in pattern of the neural response is presumably 
due to contributions from different parts of the organ of Com 
and constitutes the basis of the ability of the human ear to dis 
tingmsh the clicks from one another by their tonal quality, even 
at an intensity so low that no nerve fiber carries more than one 
impulse in response to each click (cf Fig 117, p 283) 

The three waves shown m Figs 147 and 150 have been 
arbitrarily designated F, G, and H in order of increasing latency 
The figures for latency given on page 385 refer entirely to the 
earliest, F, wave The latencies of the later waves can be meas- 
ured only to the peak, since the foot of each overlaps the previous 
wave Representative figures for the latencies measured from 
the first negative peak of the aural microphomc to the peaks of 
the three waves recorded from the round window arc F, 07 
to 1 0 msec, G, 1 9 msec, and H, 26 msec The latencies of G 
and H shorten like that of F, and to the same degree, with 
increase of the intensity of the stimulus 

The threshold for G and H is usually about 10 db below 
that for the aural microphomc when the ‘high frequency’ 
speaker is the source of the click, and when the potential is 
recorded from the round window The F wave and the coch 


> 



THE RESPONSE TO STRONG IMPULSIVE STIMULI 


387 


Iear microphonic first appear at about the same intensity of 
stimulation. The F G, and H waves all reach their maximal 
amplitudes at 30 to 40 db above their respective thresholds, al- 
though with further increase in intensity the cochlear micro- 
phonic continues to increase. 

Somewhat similar waves are seen in the action-potential 


UNMASKED 


2400 /"v/ 


1500 /%/ 


700 rv 


350 ~ 


Fig 150 Standing wave oscillograms of action potentials m response to 
clicks, recorded from the lateral lemniscus of a cat These oscillograms show 
preferential masking of different components by tones of different frequencies 
High tones reduce the earlier and low tones the later components The F, G, 
and H waves are indicated in the unmasked record (The swing below the 
baseline after the H wave is an artefact introduced by the small coupling 
condensers which were employed in order to stabilize the amplifier for clear 
photography of the standing waves ) (After Kemp, Coppee, and Robinson ) 

records obtained from mixed peripheral nerves which have been, 
stimulated electrically (Hrlanger and Gasser). The waves in 
peripheral nerves are due to differences m the velocity of conduc- 
tion in different groups of nerve-fibers The slower impulses lag 
behind the faster ones and appear as later waves in the com- 
posite action-potential. The differences in velocity are correlated 
with differences in the diameters of the nerve-fibers Large 




388 


AUDITORY NERVE IMPULSES 


fibers conduct impulses more rapidly, small fibers more slowly 
The fibers of the auditory nerve, however, are quite uniform in 
diameter about 5 microns (Lorente de No, 2) We should 
therefore expect uniform velocities of conduction, and, con 
sequently, we must seek another explanation for the occurrence 
of separate waves in the click response in the auditory nerve 
The F, G, and H waves apparently represent volleys of im 
pulses in different groups of fibers in the auditory nerve fibers 
which innervate different portions of the basilar membrane 
The evidence for this statement depends upon the suppression or 
‘masking’ of one or another of the waves when tones of various 
frequencies arc sounded at the same time as the clicks The 
phenomenon of masking will be considered m the next chapter, 
but for the present we may interpret the fact of such interaction 
between a particular tone and a click as evidence that both 
stimuli are activating the same portion of the basilar membrane 
The earliest (F) wave is preferentially masked by high tones, 
and the latest ( H ) wave by low tones In a typical case illus- 
trated by Fig 150 the F wave was selectively masked by a tone 
of 2400 cycles A tone of 1500 cycles caused greatest depression 
of the middle ( G ) portion of the response, while 350 cycles 
was particularly effective in masking the late ( H ) portion 
Complete masking of any wave is difficult to attam with a pure 
tone, since the click response will mask the tonal wave itself 
if it falls in the proper phase of the tone (see p 409) With 
strong masking tones of any frequency all waves are somewhat 
reduced, but the superior effectiveness of certain frequencies is 
very clear Since the F wave is selectively masked by a tone 
of 2400 cycles, we may infer that it is composed of nerve im 
pulses traveling in the nerve fibers which arise m the portion of 
the basilar membrane which is tuned to 2400 cycles This por 
tion of the membrane must, therefore, be stimulated by the click 
before the portion tuned to the lower frequencies which give 
rise to the G and H waves We have already seen, m Chapter 
10, how this temporal dispersion of the nerve impulses in 
response to a dick supports the concept of traveling waves on 
the basilar membrane 



THE STIMULATir-G PHASE OF THE COCHLEAR MICROPHON1C 389 


THE STIMULATING PHASE OF THE COCHLEAR 
MICROPHONIC 

The latency of the first volley of nerve impulses, measured 
from the start of the cochlear microphomc, vanes with the 
polarity of the sound waves For a given intensity, however, 
the latency of the first action potential is constant when meas- 
ured from the negative pea\ of the large initial cycle of the 
cochlear microphomc (see Table V) We may conclude, there 
fore, that the nerve impulses are initiated during only one phase 
of the cochlear microphomc, the one m which the round wm 
dow is passing from electneal negativity to electrical positivity 
This particular phase of the microphomc is associated with 


TABLE V 


Polarity of first large 
wave of the cochlear 
microphomc 

Latency of F wave of action po- 
tential measured from 

Negative peak 

of microphomc 

Positive peak 

of m crophomc 

Positive 

0 70 msec 

1 02 msec 

Negative 

0 70 msec 

0 55 msec 


outward movement of the stapes and the tympanic membrane 
(cf Fig 133, p 342) If the first major electrical change at the 
round window is from positive to negative, no impulses are set 
up during this phase They are delayed until the negative to- 
pcaTim puait: of The LOtViiear Tmciopiiornt trouts ft is siginfi 
cant that, in the cases so far studied, the decrease in latency with 
increase of intensity is quantitatively nearly equal to the dura 
tion of this negative to-positive phase of the first major wave 
of the cochlear microphomc We may suppose that at thresh 
old the necessary condition for stimulation is achieved only near 
the end of the negative to-positive phase, but with great sound 
intensity it occurs very near the beginning 







390 


AUDITORY NERVE IMPULSES 


THE MECHANISM 07 STIMULATION OF THE 
AUDITORY NERVE 

Tie association of nerve stimulation with a particular phase 
of the cochlear microphomc does not imply that the electric 
potential itself stimulates the nerve, although this hypothesis 
has been proposed The hypothesis of electrical stimulation of 
the nerve is attractive for its simplicity, but it encounters one 
serious objection It does not adequately explain the long 
latency of 0 6 msec or more, exhibited by the auditory nerve 
impulses The shortest latency of the nerve impulses which 
has been measured, even with maximal stimulation, is 053 
msec (Derbyshire and Davis, 1, 2) On the hypothesis of 
electrical stimulation, the latency must be explained by (a) 
utilization time of the stimulus acting on the nerve fibers and 
( b ) slow conduction in the nonmcdullated terminal twigs of 
the nerve fibers But, if the properties of the auditory nerve 
fibers are similar to those of other nerves, the utilization time 
for maximal stimuli should be not more than 0 1 msec at the 
most If we ascribe 0 1 msec of the delay to utilization time 
and the remainder to conduction time, we must assume a rate of 
conduction of less than 10 cm per second for the nonmedullatcd 
terminal twigs, because the velocity in the medullated portion 
is about 30 meters per second and the length of the shortest non 
medullated twigs from the internal hair-cells to the beginning 
of medullation is only 30 microns (Lorente de N6, 2) Such 
unusually slow conduction is a very difficult assumption to 
accept, particularly when we realize that the refractory period 
of the terminal twigs is as brief as that of large medullated fibers 
themselves (see p 401) 

Another difficulty with the hypothesis of electrical stimula 
tion lies in the effectiveness with which the higher audible fre 
quencies can stimulate the nerve fibers Other nerve fibers 
stimulated by alternating currents of 1000 cycles or more show a 
very high threshold, and the threshold rapidly rises with con 
tmued stimulation Auditory fatigue, whether measured sub 
jectivcly (p 217) or by action potentials (p 397), does not vary 
with frequency to anything like the extent demanded by a 
simple theory of electrical stimulation 



MECHANISM OF STIMULATION OF AUDITORY NERVE 


391 


A more promising hypothesis is that stimulation of the nerve 
fibers results from the formation or liberation of a chemical 
mediator (Derbyshire and Davis, 1), as a direct result of the 
mechanical deformation of the hair cells The most obvious 
advantage of this hypothesis is that the latency of the nerve- 
impulses is easily explained by the theory of chemical mediation 
The latency represents the time required for diffusion of a 
chemical substance to its point of action upon the nerve fibers 
and for the subsequent stimulation of these fibers No sugges- 
tion as to the nature of the hypothetical mediator has been 
offered, but the hypothesis is intended to be analogous to the 
theory of chemical mediation now generally accepted for 
neuromuscular, for neuroglandular, and for certain instances of 
synaptic transmission (Cannon and Rosenblueth) 

Chemical transmission or mediation of nervous effects was 
originally demonstrated by the following experiment by Loewi 
Two isolated frog hearts, beating spontaneously, were arranged 
so that the fluid which perfused the first was collected and used 
to perfuse the second also The vagus nerve to the first heart 
was stimulated The contractions of the first heart became 
weaker and its rhythm slower, which is the usual effect of 
nerve impulses reaching a heart by way of the vagus nerve 
But the rhythm of the second heart was also slowed, and the 
contractions were weakened No nerve impulses reached the 
second heart, but effects corresponding to vagal nerve impulses 
were transmitted to it from the first heart by way of the perfus- 
ing fluid 

A chemical substance or ‘mediator is liberated at the 
terminations of many nerve fibers when impulses reach them 
Por the vagus nerve, the mediator has been identified as 
acetylcholine, an unstable ester of choline Acetylcholine is 
liberated at all the nerve terminations belonging to the cranial 
and sacral divisions of the autonomic nervous system It is also 
liberated by the motor fibers which innervate skeletal muscles 
Acetylcholine also mediates the transmission of activity from 
one neuron to another across the synaptic junctions of the 
superior cervical ganglion, and probably across many other 
synapses in the nervous system When acetylcholine is injected 



392 


AUDITORY NERVE IMPULSES 


into the blood stream, it is hydrolyzed with great rapidity by 
an enzyme which is normally present m the blood and tissues 
This very rapid destruction prevents the acetylcholine liberated 
by a nerve fiber from exerting its effects in other organs than 
the one in which it is liberated — a fortunate provision indeed 

Acetylcholine is not the only chemical mediator of nervous 
effects At the terminations of the fibers of the sympathetic 
division of the autonomic nervous system, in the heart, in the 
muscular walls of the blood vessels, in the intestine, and in many 
other organs, the mediator is sympathin The chemical struc 
ture of sympathin has not been established, but it is known to 
be closely allied to adremn, the hormone secreted by the medulla 
of the adrenal glands In fact, adremn itself may properly be 
regarded as a chemical mediator of nervous effects whose action 
is much more diffuse and persistent than that of acetylcholine 
Several other mediators exist and control various functions 
such as the activity of chromatophorcs in fish and amphibia 
(Parker) 

The existence of not only one but of several chemical medt 
ators is firmly established Chemical mediation is so wide 
spread, indeed, that we may well suppose it is the general mecha 
nism by which activity m one cell initiates activity in another 
cell, just as the electrochemical mechanism of nerve-conduction, 
outlined in Chapter 1 1, is the general type of all-or none conduc- 
tion within single nerve cells If chemical mediation is the 
fundamental principle of intercellular transmission, then it is 
quite reasonable to suppose that sensory cells, such as the hair 
cells of the organ of Corti, should stimulate the sensory nerve 
fibers by means of a chemical mediator The mediator may 
or may not be acetylcholine, but it is worth noting that the very 
rapid destruction of acetylcholine by its esterase (Marnay and 
NarJamamnhn} would make it possible for the substance to 
appear and disappear with the flash like rapidity needed to gen 
erate nerve impulses at frequencies up to 1000 per second 



CHAPTER 17 


NERVE IMPULSES IN RESPONSE TO 
TONAL STIMULATION 

In the previous chapter we have considered the fundamental 
properties and relations of nerve impulses in the auditory nerve 
These properties are most clearly revealed by the relatively 
simple volleys of impulses initiated by single clicks We have 
seen that the impulses are m all respects similar to the impulses 
in other sensory nerves The all-or none impulse is the physi 
ological unit of nervous activity, and the response of the auditory 
nerve to tonal stimulation must consist of a series of such 1 m 
pulses 


SYNCHRONIZED ACTION POTENTIALS 

From the point of view of the auditory nerve, a tonal stimu 
lus consists of a series of discrete stimuli Each sound wave is 
itself a separate stimulus Therefore, the general laws govern 
mg the stimulation of nerve fibers by repetitive stimuli have 
here an immediate application Particularly important is the 
limitation set by the refractory phase upon the frequency at 
which a fiber may be made to respond to repeated stimulation 
It is a necessary consequence of the all or none law that the 1 m 
pulses in a single nerve fiber are unable to keep pace with the 
frequency of a sound when the frequency exceeds a certain 
v.aJji/t At f r/vii irstrjts. ahave.tb.fi. cr.iurjd. v able. the. total te sijaasfi. 
of a nerve containing many fibers may still appear to follow the 
frequency of the stimulus, but under these conditions each 
individual fiber does not respond to every sound wave, but only 
to every second, third, or fourth wave, according as its refractory 
period may determine (see p 401) The nerve impulses group 
themselves in relation to the sound waves, because of the fact 
that the impulses are set up in a definite phase of the cochlear 
393 



394 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


microphonic and hence m a definite phase relation with the 
stimulating tone 

In response to tones below 400 cycles, the response of the 
auditory nerve is a series of volleys of nerve-impulses, and 
the impulses in each volley are so nearly simultaneous that they 
are clearly separated from one another by a temporal interval, 
as shown by the straight oscillographic baseline in Fig. 151. 
The impulses in each volley are, however, slightly dispersed in 
time, just as we should expect from our knowledge of the mode 
of vibration of the basilar membrane (cf. Fig 118, p. 285). 
The temporal dispersion serves to give the composite wa\ es a 


MICROPHONE 

NERVE 

TRACT 


Fie 151 Oscillograms of action potentials in response to steady tones, com 
pared with the response of a microphone to the same tones The action- 
potentials from the lateral lemniscus of the cat in response to a 256-cyde tone 
are a senes of (downward) ’spikes’ separated by nearly flat baseline At 1024 
cycles the spikes merge into a nearly sinusoidal wave (After Hatlpike, 
Hartndge, and Rawdon-Smith By permission of The Royal Society of 
London ) 

slightly broader and more rounded contour than that of single 
action-potentials. As the frequency of the stimulating tone is 
increased, the volleys occur closer and closer to one another; 
and, when the interval between the volleys is finally reduced to 
zero and the volleys begin to overlap, the total pattern becomes 
more and more sinusoidal (Fig 151). 

The action-potentials of the auditory nerve may follow the 
frequency of the stimulating tone up to approximately 3000 
cycles. The reproduction by the action potentials of the fre- 
quency of the stimulating tone constitutes the original “Wever- 
Bray effect,’ although this term is now more often applied to 
the aural microphonics or to an indiscriminate mixture of 
microphonics and action potentials It was probably an unsus- 
pected admixture of aural microphonics in the original experi- 


[y/WWWVWVWWWV| 

1024^ 






SYNCHRONIZED ACTION POTENTIALS 


395 


ments which led Wev&r and Bray to place the frequency-limits 
for “auditory nerve-impulses” as high as 4100 cycles. When 
coaxial electrodes are employed to shield against the aural 
microphonics, the upper limit of frequency for synchronized 
action-potentials of the cat may be as high as 4000 cycles; but 
nearly maximal stimulation is required to obtain a synchronized 
response between 3000 and 4000 cycles, and the response may 
rapidly lose its synchronized character with continued stimula- 
tion. 

Relation to Intensity. The size of the recorded action-poten- 
tials of the auditory nerve increases to a definite maximum with 



FREQUENCY 

Fig 152 The initial size of the action potential in the auditory nerve of a 
cat as a function of frequency The sudden drops in the curve are due to the 
fact that above a critical frequency (800 cycles in this experiment) the 
individual fibers cannot follow the frequency of the stimulus, but must alter- 
nately respond to every other vibration At twice this critical frequency each 
fiber responds only to every third vibration (Stevens and Davis ) 

the intensity of the stimulating tone. This maximum, beyond 
which no further increase can be observed, recalls the similar 
maximum exhibited by the aural microphonics, and it is reached 
at approximately the same intensity of stimulation. The maxi- 
mal response is approximately constant for all frequencies in 



396 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


the range below 900 cycles, although the presence of higher har- 
monics makes accurate measurements difficult for tones below 
400 cycles 

Maximal Amplitude as a Function of Frequency Figure 
152 shows the relation of the maximal amplitude of the action 
potentials to frequency As the frequency is increased, a criti 
cal value is reached, usually at about 900 cycles, at which the 
maximal amplitude falls more or less abruptly to a little less 
than half its previous value It remains at this level until a 
second critical frequency, equal to twice the first, is reached 
A new level is then established at less than one third the original 
amplitude At three times the first critical value, 2700 cycles 
in this instance, there is another fall m amplitude The ampli 


Auditory nerve . 



Fic 153 T1 e onset of a 310-cycle tone products art on-c fleet in the 
auditory nerve followed by synchronized action potentials which show mod 
erate equilibration The 570-cycle tone produces an on-effect followed by 
synchronized action potent als which show marked equilibration It ts un 
usual to find so much equil bration at a frequency as low as 570 cycles (After 
Derbyshire and Davis 2 ) 

tude here is too small for accurate measurement, although the 
waves may still be seen on a cathode ray oscillograph Between 
3500 and 4000 cycles the synchronized response is lost entirely 
Impulses ascend the nerve when a high tone stimulates the ear, 
but the impulses are not synchronized with one another, or with 
the stimulating tone The discharge is then random, or asyn 
chronous, like that in a cutaneous sensory nerve when many 




FATIGUE AND EQUILIBRATION 


397 


sense-organs are stimulated by a stroking of the skin. This 
type of discharge appears on an oscillograph as a roughened 
baseline. In a loud-speaker the response is a rustle, or low hiss, 
of rather characteristic quality, and, although its measurement is 
difficult, it may be used for determination of thresholds. 

FATIGUE AND EQUILIBRATION 

The foregoing description of the behavior of action-potentials 
at various frequencies is based on measurements made immedi- 
ately after the on-effect of the action-potentials evoked by a tone 
70 db above threshold. The on-effect of the action-potentials 



Fig 154. The amplitude of the action potential of the auditory nerve during 
and after continuous stimulation at 1500 cycles, showing slow equilibration 
and recovery. The initial phase of fast equilibration (see Fig 153) occurred 
too rapidly to be plotted on this tune scale. During recovery the stimulating 
tone was sounded for about half a second every five seconds and the initial 
size of the action potentials recorded. (Derbyshire'} 

consists of a large initial volley of nerve-impulses. Following 
the on-effect the action-potentials do ftot remain constant in 
size, as do the aural microphonics, but shrink, first rapidly and 
then more slowly, to a lower amplitude (Fig. 153). The rate 
and extent of this shrinkage are definite functions of the fre- 
quency of stimulation (cf. p. 301). The reduction in the initial 




398 NERVE-IMPULSES IN RESPONSE TO TONAL STIMULATION 


size of the potential presumably represents a readjustment of 
the chemical dynamics in the nerve fiber and the attainment of 
a new equilibrium between anabolism and catabolism For 
this reason the readjustment has been termed equilibration It 
is a special case of fatigue, m that it represents an adjustment to 
a new level of sustamed activity and not an exhaustion of reserve 
material Recovery is complete within 30 sec after the end of 
a period of stimulation during which equilibration has occurred 

Equilibration is not to be confused with the hysteresis of 
the cochlear microphonics (see p 326), for equilibration occurs 
with submaximal as well as with supramaximal stimulation, 
and is a function of the frequency but not of the intensity, 
of stimulation The shrinkage continues as shown in Fig 154, 
for 5 to 7 min after the onset of a stimulating tone, although it 
is much greater and more rapid during the first 2 sec than 
during any later period An arbitrary measure of the degree 
of equilibration may conveniently be made after 2 sec, m which 
event, the initial value, with respect to which the equilibration 
is measured, should be determined immediately after the on 
effect has subsided 

MAXIMAL FREQUENCY OF IMPULSES 
IN EACH FIBER 

The general principle is well established that the degree and 
the rate of equilibration of nerve impulses are greater, the 
higher the frequency of impulses in a nerve fiber This fact 
enables us to determine with assurance the maximal frequency 
of impulses in the individual fibers of the auditory nerve by 
determining the degree of equilibration as a function of the 
frequency of the stimulus Below 400 cycles, equilibration is 
slight It becomes greater when the frequency is increased 
and it reaches a maximum (where the shrinkage is greatest) at 
900 to 1000 cycles It is upon passing through this same fre 
quency range of 900 to 1000 cycles that the size of the initial 
response falls sharply to about one half of its former value 
The equilibration passes through a second maximum a little 
below 2000 cycles where the initial response falls to its third 



MAXIMAL FREQUENCY OF IMPULSES IN EACH FIBER 


399 


plateau Another maximum in equilibration can usually be 
seen to occur near 3000 cycles, but the size of the synchronized 
action potential is here so small that satisfactory measurements 
are impossible 

The appearance of maxima of equilibration at three different 
frequencies, one of which is double and another treble the low 
est, identifies the lowest of the three (about 900 cycles) as the 
maximal frequency of impulses in each fiber This maximal 
frequency is imposed by the refractory period of the nerve fibers 
Above 900 cycles each fiber responds only to alternate sound 
waves, so that when the frequency of the tone is MOO cycles 
each nerve fiber is presumably carrying 700 impulses per second 
This behavior of the nerve fibers is termed alternation Above 
the second critical frequency of 1800 cycles, each fiber responds 
to only one sound wave in three, and the behavior can then be 
described as rotation The reduction of the initial response 
(after the on-effect) to approximately one half and then to 
approximately one third of its original size is easily understood 
on the basis of alternation and rotation, for, when each fiber 
responds only to alternate sound waves, the total number of 
fibers active at any one time is only half as great as when each 
fiber responds to every wave 

The phenomenon of alternation is not confined to the mam 
malian auditory nerve Alternation, rotation, and equilibration 
have all been demonstrated m the cereal nerve of the cricket 
(Pumphrey and Rawdon Smith) In this nerve the critical 
frequency is about 400 cycles instead of 900 cycles 

When we make actual measurements on a cathode ray 
oscillograph of maximal amplitude and of equilibration as func 
tions of frequency, we do not always encounter abrupt changes 
at the critical frequencies, although some change is usually evi 
dent The critical points become less and less definite as the 
experimental preparation ages After periods of exposure by 
operation, under anesthesia, the individual fibers tend to exhibit 
lower critical frequencies, and the total picture becomes in 
creasingly blurred Under ideal conditions, however, we may 
observe effects which approximate those pictured schematically 



400 NERVE impulses in response TO TONAL STIMULATION 


in Fig. 155. These curves art those ’which, theoitikally, should 
be obtained from an ideally fresh and uniform nerve. The 
upper curve represents the maximal amplitude of the observed 
action-potential. It falls abruptly from 12 units to 6 at 1000 
cycles. At 2000 cycles it falls to 4 units, and at 3000 cycles to 3 
units. The lower curve represents the percentage of the initial 
voltage of the action-potential which remains after 2 sec of 



FREQUENCY 

Fjc 155 Schematic diagram showing the initial voltage of action potentials 
and the course of equilibration in an ideal nerve in which the refractory 
period of all the fibers is 1 msec (See text for explanation ) 

equilibration It starts with a value of 80 per cent at 500 cycles 
and falls more and more rapidly until it reaches 30 per cent at 
1000 cycles It then rises abruptly to 80 per cent, which is the 
value it had at 500 cycles. The voltage remaining after equili- 
bration falls again to 30 per cent at 2000 cycles, and then rises 
abruptly once more to 80 per cent. This sequence is repeated 
at the higher critical frequencies. 

The simultaneous decrease, at 1000 cycles, in equilibration 
and in the amplitude of the total action-potential indicates that, 
in each fiber, the frequency of impulses has fallen to 500 per 




THE REFRACTORY PERIOD OF THE AUDITORY NERVE 401 


second, or to one half that of the stimulating tone Each fiber 
then responds only to alternate sound waves At the higher 
critical frequencies the fibers resort to rotation, and then only 
a fraction of the total number of responding fibers is activated 
by any single sound wave Under these conditions, we measure 
an action potential which is only a fraction of the size it would 
exhibit in response to a low tone It should be noted that the 
fractional reduction m the observed action potential will equal 
the fractional reduction in the number of active fibers only 
when there is a direct porportionality between the number of 
active fibers and the potential generated between the electrodes 
In actual experimental situations, this relation is probably never 
one of exact proportionality, although it may sometimes ap 
proach proportionality rather closely 

THE REFRACTORY PERIOD OF THE 
AUDITORY NERVE 

The physiological process which sets an upper limit to the 
frequency of impulses in each fiber is the refractory period 
(cf p 301) For a brief interval of approximately 1 msec after 
each impulse the nerve fiber is not excitable and cannot trans- 
mit another impulse One msec, corresponding to a maximal 
frequency of 1000 impulses per second, may be taken as the 
lowest value for the refractory period of an unfatigued fiber of 
the auditory nerve in an intact cat, and probably in man as well 
Under most experimental conditions, however, the refractory 
period appears slightly longer than 1 msec 

It is evident that, within about 1 msec, functional recovery 
is sufficient for conduction in all parts of the pathway between 
the hair-ceVi and fne coduear nucleus Even the fine non 
medullated terminal twigs in the organ of Com must recover 
at least as rapidly as this The refractory period of a tissue, 
determined, as in the present case, by stimulation through 
natural physiological channels, we term the functional refrac 
tory period 

The functional refractory period of the auditory nerve is 
not always constant It increases significantly as stimulation 



402 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


is continued, and the threshold of stimulation for each fiber 
tends also to rise In the higher nervous pathways, we find 
that the functional refractory period is approximately 1 msec 
throughout both the cochlear nucleus and the lateral lemniscus 
up to the inferior colliculus (cf Chapter 18) Above this 
anatomical level it apparently becomes much longer, or else 
differences in conduction time along different pathways become 
very great, for no synchronized impulses at frequencies above 
100 per second have been detected from the auditory cortex or 
auditory radiations 

SCHEMA FOR THE AUDITORY NERVE 

Knowing the significant properties of the individual fibers 
of the auditory nerve and the manner in which they are inner 
vated, we may construct a schematic representation of their 
activity under stimulation by a pure tone Such a representation 
should account for the phenomena which wc observe by means 
of electrical recording equilibration, synchronization, alterna 
tion, and rotation 

Figure 156 (Derbyshire and Davis) illustrates graphically 
how various factors combine to cause a high degree of equilibra- 
tion near the critical frequency In this diagram one fiber is 
assumed to have a functional refractory period less than 1 1 
msec, to continue to respond with one impulse to each sound 
wave, and to show a moderate reduction m the size of each 
impulse The next two fibers alternate when their functional 
refractory periods become prolonged beyond 1 1 msec The 
fourth fiber is assumed to cease responding entirely after a short 
tune The net result is a very considerable shrinkage in the 
composite action potential 

This diagram demonstrates how the frequency of the stim 
ulatmg tone is reproduced m the auditory nerve, even when 
the frequency is so high that no single fiber can respond to every 
sound wave The principle of alternation and rotation allows 
for such reproduction up to the pomt at which temporal dis 
persion of the impulses in different fibers obscures the initial 
synchronization with the stimulating sound waves The pan 



SCHEMA FOR THE AUDITORY NERVE 


403 


ciple of rotation, essentially in its present form, was proposed as 
a theoretical possibility by Troland, prior to its experimental 
discovery by Wever and Bray (3) Troland employed the 
principle as a basis for his theory that “pitch is determined by 


vAAaaAa/ 5T,MU i u o 




900 ~ 

CYCLE. S AFTER 
INITIAL WAVE 


ACTION POTENTIAL 
PATTERNS OF FOUR 
TYPICAL NERVES 


I I 


!■« ( I I I FIVE SECONDS 

, , i , AFTER START OF 

| l ^ i 1 STIMULATION 

l_ u L 


4 NOT STIMULATED ABOVE THRESHOLD 


I I I I I I l COMPOSITE 

Fic 156 Schema of the response of four typical fibers of the auditory 
nerve showing how the development of alternation leads to fast equilibraUon 
and how the frequency of the stimulating sound may be reproduced by the 
action potentials cv en when no fiber can respond to every sound wave. The 
solid vertical lines represent the action potentials of individual fibers, which 
dimmish slightly in amplitude with repetitive stimulation The horizontal 
lines represent the duration of the functional refractory periods which become 
slightly prolonged during repetitive activity The heavy vertical hnes show 
the composite effect of the individual action potentials The shortening of 
these hnes represents a diminution in amplitude of the total action potential 
and the thickening represents a slight temporal dispersion of the individual 
impulses (Derbyshire and Davis 2 ) 

the frequency of a series of regularly spaced impulses, earned 
in a group of cooperating fibers ’ Wever and Bray called the 
principle the volley theory, and used it in developing a hypoth 



404 NERVE-IMPULSES in response to tonal stimulation 

esis of pitch perception similar to Troland’s The term ‘volley 
theory,’ however, used to designate a rotational activity set up 
by a rapidly intermittent stimulus is not entirely appropriate 
‘Volley’ implies that all units fire at once In fact, for many 
years, this word has been used by Sherrington to designate a 
group of impulses initiated simultaneously in all the fibers of a 
nerve The impulses in the auditory nerve might be thought 
of as a senes of incomplete volleys, but a more apt military 
analogy to the rotational response by different groups of fibers 
would be ‘platoon fire ’ 

THE THRESHOLD OF NEURAL ACTIVITY UNDER 
TONAL STIMULATION 

It has already been pointed out that the threshold of nerve 
impulses for impulsive stimuli is very close to that for detection 
of the aural microphomcs, and also to the threshold of hearing 
The same generalizations apply to the neural response to tonal 
stimulation, provided the electrodes are properly placed in the 
auditory nerve The correspondence is better for low tones 
than it is for the high tones which cause alternate or asyn 
chronous response 

The threshold for a given tone is lower with the electrodes 
in one part of the auditory nerve than in another, and the post 
non of the electrodes, in the nerve, which gives maximal sen 
sitmty to a high tone is not the same as for maximal sensitivity 
to a low tone This fact provides strong support for the place 
theory of the reception of tones of different frequency, but it 
has not proved possible to map the auditory nerve systematically 
with respect to the specificity of its fibers for different tones 
This failure is presumably due to the complicated arrangement 
within the nerve of the fibers from different portions of the 
basilar membrane (see p 377) In any event, it is clear that the 
threshold values are so much a function of the exact position 
of the electrodes that it is misleading to attempt the construction 
of a complete threshold-curve from any single experiment 
Nevertheless, it is possible to state that the lowest thresholds 
found with the best placements of the electrodes correspond, 



RELATION TO INTENSITY OF TONAL STIMULATION 


405 


within a few decibels, to the thresholds of the corresponding 
aural microphonics. 

RELATION TO INTENSITY OF TONAL 
STIMULATION 

The voltage of the action-potentials recorded from the audi- 
tory nerve increases as the intensity of the stimulating tone is 
increased. Measurements of this voltage as a function of inten- 
sity are complicated by the phenomena of alternation and equi- 
libration, but, at frequencies below 500 cycles, the curve may 



Fig J57 The initial voltage of the action potentials of the auditory nerve 
(broken Unc) and the voltage of the corresponding cochlear microphotucs 
(solid line) as functions of the intensity of a 500-cycle tone. The voltage- 
scales are arbitrary and hate been adjusted so that the two curves coincide at 
their maxima The two amplitude functions run a very similar course m this 
case, but in many experiments they diverge significantly (Derbyshire and 
Davis, 2.) 

correspond quite closely to the curve for the growth of the 
aural microphonics (Fig. 157). At other frequencies, such as 
1000 cycles, the curve may depart rather widely from that of 
the microphonics. One reason for this is that near the critical 
frequencies alternation sometimes occurs with weak stimula- 
tion but not with strong. At frequencies above 3000 cycles, 




406 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


where the nerve response appears essentially asynchronous, no 
significant measurements of neural activity have been made 
As the intensity is increased, however, there is an obvious in 
crease in the number of nerve impulses passing up the nerve 
per unit of time 

The maximal amplitude of the action potential, and the 
intensity of stimulation at which the maximum is reached, both 
depend rather definitely upon the exact position of the coaxial 
recording electrodes Electrodes of this type are not equally 
affected by all the fibers in the nerve Their sensitivity is 
limited essentially to those fibers in their immediate neighbor 
hood Therefore, we cannot assume that the recorded potential 
is a function only of the number of synchronously active fibers 
m the nerve, for it is also a function of their position relative to 
the electrodes Other types of electrodes may be less limited 
in their ability to detect the activity of distant fibers, but, un 
fortunately, it is only with coaxial electrodes that it has been 
possible to eliminate the aural microphonics, and, if the micro- 
phomes are not excluded, measurement of the action potentials 
becomes quite impossible 

We should like, of course, to be able to measure the number 
of active nerve fibers as a function of intensity of stimulation 
The nerve impulses are all-or none in character, and at low fre 
quencies their frequency is determined by that of the sound 
waves Therefore, increase in the voltage of the action potential 
must depend upon increase in the number of active fibers, and, 
as long as the impulses are synchronized with a stimulating 
sound wave, it is only by increase m the number of active fibers 
that there can be an increase of activity in the auditory nerve 
At present we can say that, under the most favorable expcri 
mental conditions, the number seems to be approximately pro- 
pDjlwnz) Jo the magnitude of the aural jmerophomes It is 
to be hoped that future experiments, vuth improved technique 
will allow a more precise definition of this relation There is 
probably no theoretical necessity for an exact proportionality 

The consideration that, when the frequency of the nerve 
impulses is determined by the frequency of the sound waves, 



THE SUMMATION OF ACTION POTENTIALS 


407 


it is only by increase in number of active fibers that there can be 
an. increase of activity in the auditory nerve points to the num 
ber of active fibers as the physiological basis for the attribute of 
loudness This statement implies that change m loudness is 
associated with a corresponding change in the number of active 
fibers It does not necessarily imply, however, that loudness 
is directly proportional to the number of active fibers Summa 
tion of the central effects of single fibers may not occur m a 
simple arithmetical fashion, and, also, different fibers may form 
different central connections, so that some fibers may thereby 
contribute more than others to the total central nervous activity 
Therefore we await more extensive experimental data in order 
to decide whether or not a simple numerical relation exists 
between loudness and the number of active fibers in the auditory 
nerve (Cf p 151 ) 

THE SUMMATION OF ACTION POTENTIALS 

Since we find it expedient to regard the magnitude of the 
action potential as an experimental measure of the number of 
active fibers in a nerve, it is appropriate that we inquire more 
fully into the summation of the potentials from separate fibers 
When nerve impulses pass simultaneously down two neighbor 
ing fibers in any nerve, the combined action potential is greater 
than when only one fiber carries an impulse The combined 
action potential is approximately equal to the sum of the poten 
tials recorded separately from impulses m the two fibers It 
might appear at first glance, since the two fibers lie parallel to 
one another in the nerve, that the potential developed by lm 
pulses in two similar fibers should be no greater than the poten 
tial due to either one alone, as is true when two ordinary electric 
batteries are connected m parallel Experimentally the poten 
tials are additive, as if the batteries were connected in series 
Two considerations make this apparent contradiction of an 
atomical fact seem reasonable First, the potential of each 
nerve fiber is shunted by all the surrounding tissue, including 
neighboring nerve fibers The more effective the shunting the 
less will be the potential recorded by a pair of electrodes placed 



408 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


in contact with the nerve as a whole When a second nerve 
fiber is active it also develops a potential and no longer acts as 
a mere passive electrical conductor shunting the potential of 
the first nerve fiber This successive removal of shunts as more 
and more fibers become active is particularly important in a 
small nerve dissected free of surrounding tissue and laid upon 
a pair of electrodes 

The second consideration is more helpful for understanding 
the situation in a nerve, like the auditory nerve, which is buried 
in a large mass of surrounding tissue, so that the fraction of the 
total shunting effect which is contributed by each individual 
nerve fiber is very small The nerve may be regarded as a 
group of tiny batteries, each of high internal resistance and all 
connected in parallel and shunted by the relatively low resistance 
of the surrounding tissues Activation of two fibers instead of 
one reduces to one half the internal resistance of the source 
and nearly doubles the current flow m the external circuit 
Therefore the potential detected by the recording electrodes 
between two points in the external circuit is practically doubled 
The potentials of the individual fibers will add linearly, provided 
the internal resistance of the fibers is very high as compared to 
the resistance of the external shunting circuit This is usually 
true as a first approximation, for the electrical resistance of a 
nerve fiber is of the order of 100 megohms per centimeter of 
length (cf Hill) Of course the resistance vanes inversely as 
the cross-section of the fiber, and it is a well established fact 
that in a nerve containing fibers of varying diameters, like most 
mixed sensory and motor nerves, the individual fibers contribute 
to the total action potential quite accurately in proportion to 
their cross-sectional areas (Erlanger and Gasser) The relation 
of magnitude of action potential to size of fiber is exactly what 
wt should expect, whether we think, in terms of the internal 
resistance of the source or in terms of the shunting effect of one 
fiber on the potential of another 

Although generation of the potentials in nerve fibers is fun 
damentally different from the generation of piezo potentials in 
the hair-cells (see Chapter 14), the same electrical laws must 



RESPONSE TO COMPLEX STIMULATION MASKING 


409 


govern the summation of the effects of individual elements m 
both instances In each instance there appears to be some effec 
tive addition of the potentials of elements which anatomically 
are arranged in parallel In each instance the electrical circuits 
are complex, and we cannot safely assume that the addition of 
potentials will always be strictly linear Furthermore, we must 
always remember that cells or fibers remote from the recording 
electrodes will contribute less to the observed potential than 
similar cells or fibers close to the electrodes 

ACTION POTENTIALS IN RESPONSE TO 
COMPLEX STIMULATION MASKING 

When the ear is stimulated simultaneously by a senes of 
clicks and a pure tone, the aural microphonics show a simple 
summation of the electrical waves which would be produced 
separately by each of the two stimuli The only limitations to 
precise summation are those imposed by nonlinear distortion at 
high intensities In sharp contrast to the microphonics, how 
ever, the action potentials, m response to clicks, decrease m size 
when a masking tone or noise is added The degree of reduc 
tion of the action potentials depends upon the relative intensities 
of click and masking sound, and is greatest when the intensity 
of the clicks is near threshold Figure 147 (p 382) shows the 
effects of a masking noise on the response to clicks recorded 
from the round window 

The action potentials evoked by a click may be partially 
masked by tonal stimulation Figure 158 shows that the degree 
of masking produced by the tone depends upon the phase 
relation of click to tone The greatest reduction of the click 
response results when the click occurs at the peak of the action 
potential wave evoked by the tone If the click response occurs 
immediately before the tonal wave, the action potential of the 
click is fully developed, while that of the tone is partially 
masked The effectiveness of a tone in masking a click is also 
a function of its frequency and of its intensity 

The action potentials set up by tonal stimulation may be 
very effectively masked by simultaneous stimulation with a 



410 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


A 


B 


C 


D 


Fic 158 Oscillograms of 
the action-potentials of the 
auditory nerve in response to 
combined stimulation by 
clicks and by a 500-cycle tone 
Both click and tone arc about 
30 db above threshold. 

A The first wave of the 
dick response is almost com 
plctely masked, but the sec 
ond wave is unmasked 

B The first wave is ufl 
masked and one wave of the 
tonal response is largely 
masked. TVie sccoad wive of 
the click response is much re 
duced 

C and D show intermediate 
degrees of masking (Derby 
shire and Davis 2) 



masking noise which has a wide 
frequency spectrum (Fig 150) 
Masking of the response of one pure 
tone by another can also be demon 
strated, although less dramatically 
Quantitative studies of the masking 
of tones, comparable to those carried 
out by psychophysical methods (sec 
Chapter S) , have svot been attempted 
because of the difficulties and un 
certainties of measurement of the 
action potentials 

The reduction of the neural re 
sponse to one sound by simultaneous 
stimulation with another depends 
upon, and is a necessary consequence 
of, the refractory period of nerve 
fibers When one sound wave has 
stimulated a nerve fiber, a second 
wave which follows it within the 
functional refractory period is un 
able to set up an impulse and is 
physiologically ineffective, as far as 
that particular nerve fiber is con 
cerned Forbes has aptly referred 
to this situation as the “line busy 
effect” The degree of masking of 
the response to one sound by another 
gives a method for determining 
what proportion of nerve fibers the 
two responses share in common, for 
it is obvious that this type of inter 
ference can arise only when the two 
stimuli compete for the same nerve 
fibers The “line busy effect”prob- 
ably underlies the psjchological 
masking of one tone by another. 



INTERPRETATION OF THE PHASE-CHANGE BEAT 


411 


or by a noise, although we need not assume that it is the only 
mechanism mvolved 

The different behavior of the aural microphonics and the 
action potentials m response to complex stimulation is one of 
the simplest and most direct demonstrations of their funda 
mentally different characters The microphonics summate, the 
action potentials show masking The test of masking by a 
hissing sound readily identifies the neural components in a 
mixed response such as that obtained from an electrode on the 
round window 

INTERPRETATION OF THE PHASE CHANGE 
BEAT 

We are now in a position to consider a series of experiments 
which were designed to test the resonant properties of the inner 
ear, and reveal the origin of the electrical activity of the cochlea 
If an observer listens to a steady tone whose phase is abruptly 
shifted by 180°, he hears a discontinuity m the sound The 
discontinuity has been variously described as (1) a momentary 
period of silence, termed the phase-change beat (Hartridge), 
(2) a click superimposed on a steady tone, and (3) a momentary 
increase of loudness following immediately after the change of 
phase (Hartshorn) The momentary period of silence has 
been interpreted as revealing the presence in the ear of a tuned 
resonant structure which is less than critically damped The 
behavior of the ear is assumed to be analogous to that of a 
vibration galvanometer driven by an alternatmg current at the 
frequency to which the galvanometer is tuned When the 
phase of the current is abruptly shifted by 180°, the amplitude 
of vibration of the galvanometer dies down to zero and then 
builds up again m opposite phase 

When the cochlear microphonics and the nerve impulses 
of the auditory pathways are recorded during an abrupt change 
of phase of 180°, it appears, as shown in Fig 159, that the coch 
lear microphonics reproduce the change of phase with consid 
erable fidelity, whereas the action potentials do not The 
action potentials appear to die down momentarily and then 



412 NERVE IMPULSES IN RESPONSE TO TONAL STIMULATION 


increase again in opposite phase; and they often show, in addi- 
tion, a large transient wave resembling the on-effect or the 
response to a click 

The difference in the records of mlcrophomcs and action 
potentials raises the question of whether the microphonics are 
generated by the same vibrating structure that initiates the 
nerve impulses Hallpike, Hartridge, and Rawdon Smith con- 
cluded that the two electrical phenomena— microphonics and 
action potentials — are not traceable to the same vibrating struc- 
ture Despite the care and ingenuity of their argument, an 
alternative explanation of the observed facts appears possible 

SOUND 
WAVES 


COCHLEAR MICROPHONICS ACTION POTENTIALS 

Fic 159 Oscillographic records of sound waves, of cochlear microphonics, 
and of action potentials from the mid brain The phase of the sound waves 
is shifted by 180° at the points indicated by the arrows The frequency is 
1024 cycles (For further explanation see text) (After Hallpike, Hartridge, 
and Rawdon-Smith By permission of The Royal Society of London ) 

A change of phase of 180° is the equivalent of starting, in op- 
posite phase, a second tone of the same frequency and of double 
the amplitude of the original tone The irregularity of the 
action potential record is probably due primarily to what is 
equivalent to the on effect of thts second tone The on-effcct 
stimulates nerve fibers over a wide area of the basilar mem- 
brane, and the corresponding nerve impulses undoubtedly form 
the basis of the click heard by many observers when the phase 
of the tone is changed It is impossible to decide from records 
of the action potentials whether the diminution of the action 
potential waves which follows the phase-change represents a 
period of reduced neural activity, or whether it is actually a 
period in which many impulses are initiated b> the on-effect 
but in which the nerve impulses arc not synchronized with one 





INTERPRETATION' OF THE PHASE-CHANGE BEAT 


413 


another The response of the nerve fibers is further eompli 
cated by the refractory period of the fibers The new out-of 
phase sound waves probably find many of the fibers refractory, 
first, because of stimulation by the last of the m phase sound 
waves, and, second, because of stimulation by the widespread 
disturbance of the basilar membrane caused by the on-effect of 
the second tone In other words, the combination of refractory 
period and on effect may temporarily mask the new tone The 
cochlear microphomc, on the other hand, can reproduce the 
phase-change with greater accuracy than is possible for the 
action potentials, because it is not all-or none m character, and 
its continuity is not interrupted by a refractory period 

The most satisfactory explanation of all of the effects of an 
abrupt change of phase is to assume that the basilar membrane 
is sufficiently damped to allow it to follow such a phase shift 
with high fidelity, as revealed by the cochlear microphonics, 
and that the reduction of amplitude of the action potentials is 
due to asynchronous response to the on-effect, to the refractory 
penod of the nerve fibers, and to the resulting maskmg of nerve 
impulses If these assumptions are justified, it is not necessary 
to abandon the theory, which accords so well with so many other 
experimental observations, that the cochlear microphonics arise 
as a piezoelectric effect m the hair-cells of the organ of Corti 
and that the impulses m the fibers of the auditory nerve are 
initiated by a chemical process originating m these same hair 
cells 

The ability of the cochlear microphonics to follow the phase 
shift, without showing evidence of dying out to zero and start 
ing up again m the new phase, means that the microphonics are 
generated by a mechanism which is very nearly critically 
damped That the basilar membrane should be so highly 
damped argues strongly against the resonance theory of hearing 
(see p 360) and suggests a place theory based on the principle 
presented in Appendix II 



CHAPTER 16 


NERVE IMPULSES IN THE HIGHER 
AUDITORY PATHWAYS 

In our inquiry into the physiology of hearing, we have studied 
the mechanics of the ear, the generation of the aural micro- 
phonics, and, finally, the impulses in the auditory nerve At 
each stage of the process we have sought to relate the activity 
of the auditory mechanism to the dimensions of the stimulating 
sound Our ultimate objective, of course, is to determine how 
the characteristics of our auditory experiences arc imposed by 
the nature of the peripheral mechanism of sense organ and 
nerve We want to know the relation between the mechanical 
activity of the cochlea, the impulses in the auditory nerve, and 
the various subjective attributes of sound But the problem of 
psychophysiological correlation docs not end at the periphery 
We face the additional problem of relating our sensory discnmi 
nations to the objective phenomena observable in the central 
nervous system 

THE GROSS ANATOMY OF THE AUDITORY 
PATHWAYS 

From the point of view of psychophysiology, it is not neces 
sary to describe in great detail the anatomy of the various 
nervous pathways within the central nervous system The 
justification for this apparent neglect of facts which may be 
found in various anatomical works is not that anatomy is un 
important, but rather that our knowledge of the physiological 
activity of the various tracts is so slight that most of the anatom 
ical detail stands unrelated to other data and does not appreci 
ably illuminate the problem of psychophysiology which is our 
central concern Nevertheless, purely anatomical considerations 
may set sharp limits to physiological speculation For example, 
it has been suggested from time to time that the saccule may 
4!4 



THE GROSS ANATOMY OF THE AUDITORY PATHWAYS 415 


play some part in audition Anatomical studies (Lorente de 
No, 2) make this hypothesis untenable, since the nerve fibers 
from the saccular macula connect with the centers a^d path- 
ways involved m the regulation of equilibrium but make no 
connection with those involved in the function of hearing * 

The essential features of the anatomy of the pathways be 
tween the auditory nerve and the cerebral cortex axe presented in 
the form of a simplified schematic diagram m Fig 160 The cell 
bodies of the primary neurons constituting the auditory nerve 
are located within the modiolus of the cochlea The primary 
afferent neurons all terminate in the cochlear nucleus, a mass 
of ‘gray matter’ located m the dorsal and lateral portion of the 
medulla oblongata at the point of entry of the auditory nerve 
Here all the primary neurons form synaptic connections with 
second-order neurons The diagram illustrates the pathways of 
these neurons to the inferior colliculus and to the medial genic 
ti{ate body In the diagram of Fig 160 no single neuron is 
represented as extending the entire distance from the cochlear 
nucleus to the medial geniculate body Certainly most, if not 
all, of these pathways are interrupted by a synapse somewhere 
along the way We shall see that physiological evidence sup- 
ports this view In man, most of the fibers leaving the cochlear 
nucleus cross the midline and proceed forward, m a tract 
known as the lateral lemniscus, to the thalamic nuclei on the 
opposite side There are in addition, however, a small number 
of homolateral connections between the cochlear nucleus and 
the higher centers The third-order neurons of the auditory 
pathways all converge in the medial geniculate body, which is 
the final relay station on the auditory path to the cerebral cortex 
The medial geniculate body also receives fiber tracts from other 
sensory systems, as well as from the cerebral cortex, and it 
therefore appears to be not only a relay station but also an 
integrating and coordinating mechanism 

Fourth-order neurons connect the medial geniculate body 

*A tiny bundle of nerve fibers connecting the saccular macula directly 
w th the spiral ganglion in the cochlea has however been described in man 
(Hardy) Cf also the work o£ Ashcroft and Hallpike on the function of the 
saccule in the frog 



416 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


with the cerebral cortex by way of the auditory radiations The 
best accounts of the connections and arrangement of these 
neurons (Poljah, Walker) are based upon experiments on 



indicates successive transverse sections through various levels of the brain-stem, 
except for the upper part of the upper level This upper level represents a 
vertical section through the cerebral hemispheres, so placed as to intersect the 
cross-sectional level of the upper mid brain m the region of the medial 
geniculate body 

Only the neurons of one auditory nerve and their mam connections are 
represented, although the auditory nervous system is actually bilaterally sym 
metrical Only one pathway of each type has been represented and no attempt 
has been made to indicate the rtlatne numbers of the different types of neurons 
illustrated (After Rasmussen ) 


THE GROSS ANATOMY OF THE AUDITORY PATHWAYS 417 


monkeys (Mac act is), but the description is adequate as a first 
approximation for the human brain as well The auditory 
radiations resemble a fan, with its handle at the medial genic- 
ulate body, and pass into the white matter of the superior tem- 
poral convolutions of the cortex as a thm sheet of nerve-fibers. 
The radiations also include corticofugal fibers running from the 
cortex back to the medial geniculate body. Only a few of the 
afferent fibers of the auditory radiations reach the convex face 
of the temporal lobe. Most of the fibers enter a small region in 
the posterior half of the horizontal wall of the Sylvian fissure 
(Fig. 161), which represents what Poljak terms “the nuclear 
or focal zone” of the entire auditory cortex. Apparently all 



Fig. 161 Diagram of the auditory projection area in the monkey 
(Macacus) 

Ft — Sylvian fissure 

a — dark shading, the area receiving fibers of the auditory radiations 
from the medial geniculate body This area extends deeply into 
the Sylvian fissure 

x — light shading, the area receiving other afferent fibers which ap. 
parently do not belong to the auditory radiation (Poljak, 2 ) 

the auditory impulses which reach the cerebral cortex must 
pass through the focal zone and thereafter be distributed to 
surrounding areas. Other afferent fibers from the midbrain 
which do not belong to the auditory radiations are distributed 
along the entire Sylvian fissure. None of the surrounding tem- 
poral cortex appears to receive any fibers from the subcortical 
nuclei. Walker describes the focal zone very specifically as 
an area 6 to 8 mm in length and 4 mm in width, sharply 



418 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


bounded, and possessing distinctive features in the character 
and arrangements of the cells of which it is composed. It lies 
just mesial to Brodmann’s area 22. 

The projection from the medial geniculate body upon the 
cortex shows a precise point-to-point relation. The arrange- 
ment of the cells in the medial geniculate body and of the fibers 
in the auditory radiations is extremely regular and orderly. 
Figure 162 shows the relation, demonstrated by Walker, be- 
tween portions of the cortical auditory area and the medial 



Fic 162 Schema of the projection oE the medial geniculate body (below) 
upon the cortical auditory area (above) Each of the variously shaded 
portions of the geniculate body projects to the similarly marked portion of the 
cerebral cortex 

Ant — anterior L — lateral 

M —medial Post — posterior 

SS —Sulcus sylvu (Walker, 2) 

geniculate body. The precision of the relations suggests the 
possibility of a fixed and stable projection of the organ of Corn 
upon the cerebral cortex. 

THE MICROSCOPIC ANATOMY OF THE 
COCHLEAR NUCLEUS 

The finer microscopic structure of the cochlear nucleus de- 
serves a brief description, if only to counteract the impression of 



THE MICROSCOPIC ANATOMY OF THE COCHLEAR NUCLEUS 419 


simplicity unavoidably conveyed by such a diagram as Fig 160 
The bewildering complexity of the nucleus must be thought of 
as typical of the cerebral cortex and of the gray matter of the 
central nervous system in general The arrangement of the 
auditory fibers as they enter the cochlear nucleus and distribute 
to the various portions of this structure has already been pictured 
and described in Fig 146 (p 378) A closer inspection of the 
elements composing the first central station of the cochlear fibers 
shows that these elements are much more complicated structures 
than is generally assumed Each cochlear fiber makes connec 
tions with numerous cells These cells are grouped in no less 
than thirteen regions, in which different types of neurons, ar 
ranged in characteristic fashion, are found (Lorente de No, 2) 
Although numerical data are not available, it is probably safe 
to state that in the primary cochlear nuclei no less than forty 
or fifty types of neurons are present, and that each cochlear fiber 
establishes connections with many hundreds, and perhaps even 
thousands, of cells The cells may be divided into classes For 
example, there are cells with long and cells with short axons 
The former cells convey impulses which are carried from the 
cochlear nucleus to higher centers, and may be called efferent 
cells, but the short axons of the second type of cell do not extend 
beyond the boundaries of the cochlear nucleus itself They 
make interconnections within this nucleus, and finally impinge 
upon cells with long axons The cells with short axons which 
remain entirely within the boundary of the nucleus have fre 
quently been called intercalary or inter nuncial neurons These 
names, however, are to some extent misleading because the 
cells with short axons are not arranged ‘in series’ between the 
primary afferent fibers and the efferent neurons, but are ar 
ranged in the form of alternative routes ‘in parallel’ with the 
direct synapse between the first-order and the second-order 
neurons The physiological significance of this type of double 
connection between the primary afferent fibers and the higher 
order neurons is, as yet, quite obscure, but it appears to be the 
type of connection which is characteristic of all parts of the 
gray matter of the central nervous system When we speak of 
first order, second order, and third-order neurons, we have in 



420 NERVE IMPULSES IN THE HIGHER AUDITORY PATHV AYS 


mmd the simple concept of the diagram in Fig 160, and we 
do not take account of the short accessory neurons within the 
nuclei The designation, second order neuron, means that an 
impulse coming from the periphery must have traversed at least 
one previous neuron and synapse before arriving at this, the 
second order neuron Third order implies that at least two 
previous neurons and synapses have been traversed, and so on 
Wc shall see that experimental observations indicate that trans 
mission may be, and often is, as direct as this The short ac 
cessory neurons, however, give almost limitless opportunities 
for interconnection and interplay between the various fibers of 
the auditory pathways 

NERVE IMPULSES IN THE COCHLEAR NUCLEUS 

No very systematic study of the nerve impulses m the coch 
lear nucleus has been carried out An electrode placed on the 
surface of the cochlear nucleus detects activity much like that 
in the auditory nerve This activity usually includes an admix 
ture of aural microphomcs conducted along the audttory nerve 
and the meningeal covering of nerve and brain Coaxial 
electrodes inserted into the substance of the nucleus detect 
large and complex responses to sounds The character of the 
responses depends upon the exact position of the electrodes, 
but the responses are often of higher voltage than those obtained 
from the auditory nerve It is generally true that the voltages 
obtained from gray matter, which contains the bodies of nerve 
cells and the synaptic junctions, are larger than those obtained 
from white matter, which consists of axons alone 

NERVE IMPULSES IN SECOND ORDER AND 
THIRD ORDER NEURONS 

'it is unnecessary to raWiate aVi fne various arofcuimc?! 
structures of the midbrain from which auditory impulses have 
been detected The list includes all the structures named in 
Fig 160 Furthermore, practically all the structures from 
which auditory activity has been reported belong to anatomi 
cally recognized auditory pathways 



THE AMPLITUDE OF ELECTRIC RESPONSE 


421 


The most complete investigation of the nerve impulses in 
auditory pathways has been made in the cat by Kemp, Coppee, 
and Robinson The following account is based almost entirely 
on their work 

The trapezoid body is a convenient structure m which to 
observe the impulses in second-order auditory fibers Its re 
sponse to stimulation of the ear by low tones is a senes of volleys 
of action potentials synchronized to the frequency of the sound 
waves The electrical pattern closely resembles that of the first 
order neurons of the auditory nerve The threshold is as low 
as, and frequently 10 to 20 db lower than, that for the aural 
microphonics simultaneously recorded from the round window 

Upper Ltmtt of Synchronization The upper limit of fre 
quency for synchronization is about 2500 cycles, and this limit 
is attained only under strong stimulation Apparently the lm 
pulses can maintain synchronization through one, and occasion 
ally two, stages of alternation Thus, the limit is lower than 
it is m the auditor) nerve This lower limit may be due to a 
tendency for the impulses to become more dispersed in time as 
they traverse synapses, or to a tendency of second-order neu 
rons to respond irregularly when bombarded by impulses at a 
frequency too high for the neuron to follow We may note 
here that the upper limit of synchronization in the third-order 
neurons of the inferior colliculus may be as high as 1500 cycles, 
but more commonly it is near 1000 cycles It therefore appears 
that distinct alternation no longer readily occurs when two 
synapses have been tra\ ersed, or else that the alternation becomes 
completely obscured by temporal dispersion This failure of 
synchronization, at frequencies above about 1000 cycles is lm 
portant evidence against the adequacy of a frequency theory 
of pitch perception (p 359) It may also be significant that 
binaural beats may be heard only when the generating tones 
are not higher than 800 cycles (p 172) 

THE AMPLITUDE OF ELECTRIC RESPONSE 
The amplitude of the electric response at a given point in 
the trapezoid body increases with increasing intensity of the 



422 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


stimulating tone, but, for a given position of the coaxial elec- 
trodes, the increase may be quite irregular (Fig 163) The 
same kind of irregularity is found also in the response of third 
order neurons, and different electrode positions give different 
curves for one and the same tone The irregularities are prob 
ably due to the fact that the coaxial electrodes detect highly 
localized activity at specific positions in an anatomically com 
plicated nerve tract The situation is similar to that found in 



Fig 163 The amplitudes of action potentials from a point in tl c lateral 
lemniscus as a function of the intensity of the stimulating tones Intens ty is 
expressed in decibels above an arbitrary threshold The amplitudes increase 
is additional fibers are activated but the increase is often irregular because the 
pickup of the coaxial electrodes is confined to a small area selected at 
random in a large complex tract (After Kemp Copp<?e, and Robinson ) 

the auditory nerve, but the effect of slight differences in the 
position of the electrodes is even more critical m the trapezoid 
body There underlies this irregularity, however, the important 
implication that the impulses generated by a given tone at a 
given, region of the basilar membrane are assigned for conduc 
tion to certain well-defined groups of fibers 

The Response of Second and Third Order Neurons to Jm 
puhwe Stimuli To impulsive stimuli, such as clicks, the 
response of second and third-order neurons is complex, like 



THE AMPLITUDE OF ELECTRIC RESPONSE 


423 


the response of the first-order neurons of the auditory nerve. 
Tile F, G, and H waves described in Chapter 16 can usually be 
recognized in the higher pathways. They show here the same 
relative latencies and are masked by the same tones as in the 
peripheral nerve. In the higher centers, however, the shape 
of the action-potential wave as a whole is somewhat more 
variable as a function of the position of the electrodes. Figure 
164 shows how, with a given placement, the amplitude of the 



Fic 164 Amplitude of action potentials of the lateral lemniscus (third 
order neurons), as a function of increasing intensity of a click. The apparent 
maximum obtained at position A is probably due to localized recording and is 
no indication that additional fibers are not activated at higher intensities, for 
from position B in the same preparation a response is obtained which has a 
considerably higher threshold and which continues to grow in amplitude over 
a correspondingly higher range of intensity (After Kemp, Coppee, and 
Robinson ) 

F wave increased over a range of 30 to 40 db and then attained 
a maximum. At a nearby electrode-position the threshold was 
30 db higher than at the first, but the amplitude increased 


424 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


over an intensity range of 30 db above its own threshold Over 
this range the response at the first position remained constant 
It is evident that the total response in the nerve tract as a whole 
increased over the entire range, and that the limit reached at 
the first electrodeposition was fictitious, in the sense that it 
resulted from the localizing power of the coaxial type of elec 
trodes 

THE LATENCY OF NERVE IMPULSES IN THE 
HIGHER PATHWAYS 

Figure 165 shows the latencies of nerve impulses in first 
order, second-order, and third-order neurons in relation to one 
another, and also in relation to the strength of the stimulus 
The measurements were made on the earliest (F) wave in 
response to clicks of various intensities For strong stimuli, the 
latency of this wave in the auditory nerve is 09 msec with 
respect to the beginning of the cochlear microphomc In the 
second-order neuron it is 22 msec, at the same intensity of 
stimulation The difference of 1 3 msec must be attributed to 
(a) conduction time and (b) synaptic delay m the cochlear 
nucleus A reasonable maximal allowance for conduction time 
is 05 msec We therefore arrive at a value for the synaptic 
delay of about 0 8 msec 

The latency between second order and third-order neurons 
lies between 1 3 and 1 7 msec When we again make allowance 
for conduction time we find that the synaptic delay m the 
olivary complex, or in the nucleus of the lateral lemniscus, must 
also be at least 0 8 msec This value, in fact appears to be a 
good average value for the shortest synaptic delays which we 
encounter, and is nearly the same as the 0 6 msec given by 
Lorcnte de No (3) for the minimal delay in the oculomotor 
nuclei Also the delay between the hair cells and the primary 
auditory neurons is usually at least 06 msec (p 385) 

It will be seen that the curves in Fig 165 fall into three 
groups, separated m latency by I 2 msec or more, corresponding 
to the different orders of neurons The systematic and step- 
like increase in latency gives a basis for deciding, m doubtful 



LATENCY OF NERVE IMPULSES IN HIGHER PATHWAYS 425 


cases, whether a response from a particular region represents 
the activity of a neuron of the first, the second, or the third 
order, for a minimal delay of 0 6 to 0 8 msec appears to be a 
general feature of synaptic transmission On this basis it ap- 



Fic 165 A comparison of the latencies of actioD potentials at five positions 
in the auditory pathways in a cat. The auditory nerve-responses were 
recorded from the round window The second-order neurons were fibers of 
the trapezoid body and the third-order neurons were fibers of the lateral 
lemniscus Note that the latent period decreases with increasing intensity of 
the stimulating click until jr reaches a final minimal value (cS Fig 149) and 
that the change m latency is greater in the second and third-order neurons 
than in the fibers of the auditory nerve The threshold in this figure is 
arbitrarily chosen as an intensity level slightly lower than that which elicited 
responses from the trapezoid fibers (After Kemp Coppee, and Robinson ) 

pears that all pathways which reach the level of the midbrain 
have been interrupted by synapses at least twice — once in the 
cochlear nucleus and again in the olivary complex or the nucleus 
of die lateral lemniscus — for no impulses have been found in 




•426 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


the medial geniculate body with latencies brief enough to 
belong to second-order neurons 

From fourth-order neurons, in the brachium and medial 
geniculate body, responses to clicks may sometimes be obtained 
Their latencies, although not shown in Fig 165, are 4 to 5 msec 
From the auditory cortex, under favorable conditions of anes 
thesia, similar responses appear with a minimal latency of 
about 8 msec 

Returning to Fig 165, we observe that the curves for all 
orders of neurons show a diminution of latency with increase 
of stimulus intensity The decrease is greatest for sounds near 
threshold In the primary neurons of the auditory nerve the 
decrease is from 1 3 msec to 09 msec, that is to say, 04 msec 
The decrease amounts to 0 7 msec in the second order and third 
order neurons Evidently there is a significant shortening of 
the delay in the cochlear nucleus as a result of increasing the 
number of active fibers A similar diminution of synaptic delay 
regularly occurs at the junction between second order and third 
order fibers, as the downward sweep of the upper curve clearly 
shows By the time third or fourth-order neurons are reached, 
the cumulative effect of the shortening of all the synaptic delays 
is considerable 

The fourth order neurons connecting the inferior colliculus 
to the medial geniculate body and the neurons of the auditory 
radiations are difficult to study, for the higher synapses of the 
auditory pathways are more or less completely blocked by the 
surgical anesthesia necessary for such experiments In some 
cases impulses have been detected, latencies measured, and the 
limit of synchronization determined, but our knowledge con 
cermng these neurons is quite meager The upper limit of 
synchronization in the auditory radiations appears to be far 
below 500 cycles, and may be as low as ICO cycles, which is the 
highest frequency of response yet recorded from the auditory 
cortex Single widely separated clicks may yield large well 
synchronized responses from these higher-order neurons, but 
fatigue or equilibration appears early, and at low frequencies, 
and the response to a steady tone consists chiefly of an on-effect, 



RELATION OF LATENCY TO AUDITORY LOCALIZATION 427 


and perhaps also an off-effect, with comparatively slight, usually 
asynchronous, activity while the tone continues In this respect 
the activity of the auditory radiations resembles that of the 
cortex more closely than it does that of the third-order neurons 
Loss of precise synchronization of impulses and great vulnera 
bility to anesthetics seem to characterize the synapse between 
third and fourth-order neurons, and also the synapses of the 
cortex The synapses of the olivary complex, on the other hand, 
are about as resistant to anesthetics as are those of the respira 
tory center, and those of the cochlea nuclei are even more 
resistant 

RELATION OF LATENCY TO AUDITORY 
LOCALIZATION 

The earlier arrival of larger volleys of impulses at the centers 
of the midbrain (Fig 165) is interesting in connection with the 
phenomena of the apparent direction of the source of a sound 
In Chapter 6 we found that the earlier arrival of a click at one 
ear causes us to refer the source of the sound to that side A 
similar effect is produced by making the sound louder in one 
ear, and a delay m arrival at one ear can be offset, to some 
extent, by increasing the intensity of the later sound (Trim 
ble 4) 

The relations presented m Fig 165 suggest that a difference 
m intensity may, ui the higher pathways, be physiologically 
equivalent to a difference in time Such a reduction of inten 
sity to time would simplify the theory of localization, but the 
experimental data on localization do not justify a complete 
reduction of this sort Furthermore, measurement of the small 
est increase m intensity of a tone m one ear, necessary to shift 
the apparent source of a tone to the side of that ear (Fig 72, 
p 170), shows that the necessary increase is less than 1 db at 
medium intensities and that it increases at lower levels The 
curves of Fig 165 suggest, however, that the necessary increase 
of intensity should be smaller at low levels, instead of larger 
In other words, no simple relation between tune and intensity, 
as factors in localization, can be derived from our present data 



428 NER\E IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


THE EFFECTS OF BINAURAL STIMULATION 

The cochlear nucleus gives little or no electric response to 
soundwaves delivered to the opposite ear, provided bone 
conduction and other forms of direct spread of the stimulus to 
the homolateral cochlea are scrupulously avoided Likewise 
in the trapezoid bodies, the activity is determined by the stimu 
lation of the ear from which the tract originates, but, by the 
time the superior olivary complex and lateral lemniscus have 
been reached, there is a complete mingling of homolateral and 
contralateral fibers from the two cochlear nuclei In the cat, 
there seems to be approximately the same number of crossed 
and of uncrossed fibers in the lateral lemniscus At any given 
level the homolatcral and contralateral neurons are of the same 
order, that is to say, homolateral second order neurons mingle 
with contralateral second order neurons, third order with third 
order, and so on At any given point, where homolateral and 
contralateral fibers are mingled, the latency of response to 
stimulation of the right ear is equal to that following stimula 
tion of the left ear In other words, impulses originating 
simultaneously in the right and left ears pass simultaneously 
up both the right and left lateral lemniscus In the right 
lateral lemniscus the impulse originating in the right car does 
not lead in time the corresponding impulse from the left ear 
(Kemp and Robinson) 

The impulses from the two ears run in separate fibers of 
the lateral lemniscus quite independently of one another If 
the two ears are stimulated simultaneously, the latency of 
response m the lateral lemniscus is the same as that following 
stimulation by the same intensity delivered to either ear alone 
Although doubling the intensity of stimulation of one ear may 
appreciably reduce the latency of the response (see Fig 165), 
*htt 'Jv'mjihUuw. of. *hr. <u;jj ikiik. ear. hv ; the. same, 

intensity does not cause any change in latency Nevertheless 
the amplitude of the total response at the lateral lemniscus is 
increased, and, at least at low intensities, the total response may 
quite accurately equal the sum of the homolateral and contra 
lateral responses taken separately Presentation of a tone to 



EFFECTS OF OPERATIVE DAMAGE TO AUDITORY PATHWAYS 429 


one ear and a click to the other does not result in masking, 
and there is no evidence of masking of one click by another 
delivered slightly earlier to the opposite ear In the language 
of neurophysiology, we can say that there is little if any facilita 
tion, occlusion, or interaction of any sort between the homo- 
lateral and contralateral fibers of the auditory pathways at the 
levels of the cochlear nuclei or of the superior olivary complex 
The pathways are anatomically mingled, but physiologically 
independent, at least up to and including the third-order neu 
rons Whether or not binaural interaction occurs in the inferior 
colliculus or the medial geniculate body has not yet been 
determined 

EFFECTS OF OPERATIVE DAMAGE TO THE 
AUDITORY PATHWAYS 

A totally different experimental approach to the problem 
of the interaction between, and the relative importance of, 
bilaterally symmetrical portions of the auditory system is the 
surgical removal of one or more parts of the auditory pathways 
Such operations have been performed (Brogden, Girden, Met 
tier, and Culler) upon dogs that had previously been trained 
by the method of conditioned reflexes to respond to the faintest 
audible tone Their auditory acuity was determined prior to 
operation, immediately following it, and also a considerable 
period later This type of animal experimentation most nearly 
resembles psychological measurements made upon human 
beings, and gives a fair indication of the effect of any particu 
lar operation upon the auditory acuity of the animal 

Removal of the cerebral cortex of a single hemisphere is 
followed by a comparatively small loss of acuity At 1000 
cycles, the loss is only 2 to 5 db, whieh is a barely significant 
loss (see Fig 166 A) The loss is practically the same whether 
the right or the left hemisphere is removed Removal of the 
entire cortex, however, produces an enormous loss of hearing, 
of from 70 to 75 db (Fig 166 B) It is of interest that the con 
ditioned auditory response can be elicited at all after total 
removal of the cortex This finding is contrary to a very gen 



430 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


erally accepted belief that the cerebral cortex is necessary for 
the performance of conditioned reflexes Apparently the cor 
tex is not strictly necessary for conditioning, provided stimuli 
of sufficient intensity are employed 

Destruction of one cochlea in an otherwise intact animal is 
followed by a hearing loss of about 3 db (Fig 166 C ) This 



2 TO S 00 LOSS TO TO n OB LOSS 3 OB LOSS 



S OB LOSS i OB LOSS 

Fic 166 Diagrams showing the hearing loss which results from removal 
of various portions of the auditory mechanism in dogs 
A — removal of one cerebral hemisphere 
B — removal of both cerebral hemispheres 
C — destruction of one cochlea 

D — removal of one cerebral hemisphere and destruction of the homolateral 
cochlea Hearing here depends upon the uncrossed fibers of the opposite 
lateral lemniscus 

E — removal of one cerebral hemisphere and destruction of the contralateral 
cochlea Hearing here depends upon the crossed fibers of the right lateral 
lemniscus The amount of hearing loss is the same in D and E 

corresponds quite well with human experience that binaural 
listening is more sensitive than monaural (see p 52) Destruc 
tion of both cochleae naturally produces total deafness 

In an animal from which one cerebral hemisphere has 




EFFECTS OF DAMAGE TO MEDIAL GENICULATE BODIES 431 


already been removed, the destruction of one cochlea causes 
an additional drop of 10 db (Fig 166 D ) It is immaterial 
whether the cochlea destroyed is on the same or opposite side 
as the remaining cerebral hemisphere (Fig 166 E) The 
equivalence of the two ears, when only one cerebral hemisphere 
is intact, shows that the crossed and the uncrossed central con 
nections are equivalent with respect to auditory acuity The 
loss of hearing is the same whether the impulses reach the 
remaining cortical hemisphere by way of crossed or uncrossed 
fibers That there is an equivalence between crossed and un 
crossed fibers coincides with the conclusions derived from the 
study of the nerve impulses in cats, but we have no assurance 
that these conclusions can be carried over to man, in whom there 
may be somewhat different proportions of crossed and uncrossed 
fibers m the lateral lemniscus 

THE EFFECTS OF DAMAGE TO THE MEDIAL 
GENICULATE BODIES 

We have already seen that the arrangement of neurons in 
the medial gemculate bodies and the auditory radiations is 
notably systematic and orderly (cf Fig 162) The arrange 
ment apparently corresponds to the arrangement in the audi 
tory nerve and cochlear nucleus, for localized surgical lesions 
cause differential loss of sensitivity for particular tones Using 
a Horsley Clarke stereotaxic instrument, Ades, Mettler, and 
Culler introduced an electrode through a small hole in the skull 
and on through the cerebral hemisphere into the medial genic 
ulate body, where they produced a small localized lesion No 
significant damage was caused to other parts of the brain by 
this operation The cats had previously been conditioned to 
respond to the faintest audible tone After the operation, 
hearing losses of as much as 20 db were found for particular 
tones, and the tones for which acuity was diminished coxre 
sponded systematically to the locations of the lesions These 
locations were checked by microscopic examination 

The results of these experiments demonstrate that a particu 
hr faint tone excites a restricted region within the gemculates 



432 NERVE IMPULSES IN THE HIGHER AUDITORV PATHWAYS 


and that the location of the excited region depends upon the 
frequency of the tone The several foci are disposed in a spiral 
within the gemculates, as follows 


Frequency 

Location* 

125 

Base (ventral side) 

250 

Base (ventral side) 

500 

Medial quadrant 

1000 

Posterior quadrant 

2000 

Lateral quadrant 

•4000 

Rostral quadrant 

8000 

Dorsal region 


The disposition of the auditory pathways in the gcniculates 
seems, therefore, to conform with the organization found else 
where in the auditory mechanism 

THE AUDITORY CORTEX 

The highest level of the nervous system is the cerebral 
cortex We can trace the auditory pathway from the cochlea 
to a particular region near the Sylvian fissure of the temporal 
lobe, but from there on the possible pathways and intercon 
nections are so numerous that none can be designated specifi 
cally as auditory pathways 

Clinically, we can recognize disabilities of hearing related 
to disease of the sense-organ, of the auditory nerve, and, more 
rarely, of the lateral lemniscus or other higher pathways The 
usual disability manifests itself as a loss of acuity Loss or 
malfunction of portions of the cerebral cortex, on the other 
hand, may give symptoms of quite another sort, such as aphasia, 
amnesia, etc , in which the patient hears the sounds but fads 
to associate their usual meanings with them There may occur 
.? Jtar aV Jtcnuvd') sv .toss nf jro.ea.aiqy inr 

words Or the patient may suffer from auditory hallucinations, 
and hear sounds or words for which there is no recognizable 
counterpart in the external world Such hallucinations may 

• The locations for the two lowest tones (12> and 250 cycles) have not been 
established as definitely as the locations Cor the higher tones 



ELECTRICAL ACTIVITY OF THE CORTEX 


433 


sometimes be part of a more or less generalized spontaneous cor 
tical excitation, as in an auditory ‘aura’ precedmg an epileptic 
seizure 

Further description and analysis of such conditions he be 
yond the scope of this book, because at this cortical level new 
difficulties and complexities appear On the physiological side, 
we can no longer trace synchronized nerve impulses, and on 
the psychological side we are uncertain as to what aspect of 
experience to choose in the search for discriminations which 
may be correlated with differences m physiological activity It 
may be profitable, nevertheless, briefly to describe the electrical 
phenomena of the cerebral cortex, for, whatever their psycho 
logical correlates may be, they are closely related to the activi 
ties of the sensory pathways 

ELECTRICAL ACTIVITY OF THE CORTEX 

The cerebral cortex is the seat of continuous, more or less 
rhythmic, fluctuations of electric potential whose periods are 
of the order of tenths of a second The electrical waves are 
much slower than the action potentials of nerve trunks, and 
may or may not represent a different type of underlying bio- 
chemical and biophysical activity Much of the activity appears 
to originate spontaneously \uthm the central nervous system 
and not to be conditioned directly by, or dependent upon, in 
voming sensory impulses (see Jasper for a review of these 
phenomena) The spontaneous background of activity may be 
increased or, more commonly, decreased by sensory stimulation, 
and the activity is profoundly modified by general internal 
conditions, such as oxygen supply, anesthesia, and sleep The 
patterns of spontaneous activity are broadly similar in voltage 
wave form, and rhythm, in all parts of the gray matter which 
have been studied, but there are slight differences in pattern 
which are more or less characteristic of particular anatomical 
areas These differences may be correlated with differences in 
microscopic structure of the gray matter and with differences 
in peripheral connections and physiological function, but their 
significance is, as yet, purely empirical and poorly understood 



434 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


Stimulation of a sensory pathway usually depresses the spon 
taneous activity in the corresponding cortical field with which 
that pathway is most directly connected, and it may depress or 
modify activity in other fields as well In addition to these 
general effects, there is, at the beginning of stimulation, a well 
marked electric response in the corresponding cortical field In 
o r der to see the response clearly we must either avoid the effects 
of anesthetics entirely, in order not to depress the response, or 
else anesthetize deeply enough to silence the background of 
spontaneous activity Action potentials resembling on-effects 
are readily obtained m the optic area (area striata) and are 
present, but not so well developed, in the auditory area In the 
cutaneous tactile area, the response to local contact with the 
skin is quite precisely localized to a small part of the sensori- 
motor cortex (Marshall, Woolsey, and Bard) If, in the ear, 
we think of a pure tone as stimulating locally a part of the 
basilar membrane, we might expect, by analogy, to find a cor 
respondmgly localized response in the auditory area of the 
cortex However, such localization has not been directly dem 
onstrated If a sustained increase or decrease of electrical ac 
tivity appears at all in response to a sustained tone, it seems to 
include the entire auditory area (Kornmuller) There is, on 
the other hand, a strong suggestion from the study of lesions 
of the temporal lobe, which have caused deafness in man, that 
perception of high tones depends upon the medial portion of 
the transverse temporal convolution and that perception of low 
tones is a function of the anterior and lateral portions of the 
same convolution (Pfeifer) 

The response of the auditory area to a single click, and to the 
onset of a sustained tone, is a single electric wave, or action 
potential, which requires at least 3 msec to reach its maximum 
and which falls still more slowly The surface of the cortex 
becomes relatively more electropositive during the response 
The latency of this electric wave is at least 8 msec The fre 
quency of a pure tone is not reproduced, but a succession of 
sharp clicks, at frequencies up to 100 per second, may elicit a 
corresponding senes of small action potentials At higher 



PSY CH OPHY SI OLOCICAL RELATIONS IN HEARING 


435 


frequencies the individual waves can no longer be detected 

All things considered, it is obvious that, until many more 
data are available, it is futile to speculate as to the relation 
between the electrical activity of the cortex and the psychotogi 
cal phenomena of hearing 

THE PROBLEM OF PS YCHOPH YSIOiOGIC A1 
RELATIONS IN HEARING 

Although we have found it impossible to trace the physio 
logical determinants of auditory sensation through the higher 
pathways of the cortex, we have not failed in our effort to dis 
cover the organic basis of many sensory discriminations We 
have noted, from time to time, that the ultimate form of cer 
tarn auditory responses is imposed by the nature of events m 
the middle ear, in the cochlea, and in the auditory nerve, and 
in some instances it has proved appropriate to relate psycho 
logical with physiological functions An attempt, at present, 
to account for all psychological discriminations in terms of 
physiological processes is obviously premature, and explana 
tions in this realm of psychophysiology must be cast in specula 
tive form Although speculation may be hazardous for the 
good repute of the speculator, it fulfills an important purpose 
when it serves to give perspective to a field of inquiry, or when 
it stimulates research designed to replace speculation by factual 
demonstrations It has been in this spirit that occasional sug 
gestions as to possible psychophysiological correlations have 
been ventured 

Two principles must guide all our efforts to relate psycho 
Isigvcd wTlVi pu’ys-ttjitig'ivd’i facts {V) 'No dirscrrmiT^oiy rcw. 
tion to an auditory stimulus is possible unless there exists, at 
every stage of the auditory process, a differentiated pattern, one 
of whose aspects provides a basis for the reaction in question 
On the other hand (2), the presence of a particular differen 
tiated pattern does not insure that a discriminatory reaction will 
be possible, because differentiation at lower levels may become 
lost at higher levels Thus the presence, in the auditory nerve, 
of impulses synchronized with the waves of the stimulus prob 



436 NERVE IMPULSES IN THE HIGHER AUDITORY PATHWAYS 


ably does not furnish grounds for a subsequent discrimination, 
for the synchronization is apparently lost at later synapses, with- 
out there establishing a surrogate pattern whose differentiation 
reflects the frequency-aspect of the previous pattern. 

The first of these principles — the requirement of adequate 
differentiation at each stage— does not mean that the crucial 
aspect of the pattern of differentiation underlying a later re- 
sponse will be obvious under our methods of analysis. It fre- 
quently happens that a discrimination rests upon a particular 
combination of one or more observable aspects of the earlier 
pattern For example, ordinary methods of analysis show that 
a pure tonal stimulus can be adequately characterized in terms 
of the two aspects, frequency and intensity. The four dis- 
criminatory reactions which we call pitch, loudness, volume, 
and density are each based upon some particular combination 
of these two variables of the stimulus The interaction of the 
stimulus with the human organism establishes neural patterns 
differentiated in at least four different ways, so that under dif- 
ferent attitudes, or Attfgaben, four different types of reaction 
are possible The attitude which we give an observer by way 
of an instruction to attend to a particular aspect of his sensations 
is of great consequence in this process, for it determines the 
character of the ultimate differentiation quite as much as does 
the stimulus itself In this sense, the instruction, or the self- 
instruction, accepted by the listener can be regarded as part of 
the stimulus-pattern, although this part of the stimulus usually 
enters by way of another modality, or at an earlier time, and is 
not represented in the events which we record as electrical 
effects in the sense-organ and nervous pathways 

These general principles are basic to any understanding of 
the relations between the three aspects of the perceptual process 
which we somewhat arbitrarily distinguish as stimulus, neural 
activity, and response Such principles must guide our future 
explorations of this vast and important field of research— a field 
in which our excursions have only iust begun, but which holds 
rich rewards for man’s irrepressible curiosity to know why 



APPENDIXES 



APPENDIX I 

FORMULAS FOR MODULATION 

Amplitude Modulation In the formula for a sinusoidal 
wave, 

y = A sin to/, 

where to is the angular velocity and t is time, let A, the ampli 
tude of the wave, vary with time in such a way that 

A = (1 + m sin qt ) 

where m represents the amplitude of the modulation, and q is 
the angular velocity of the modulating wave Then, if we let 
A = 1, substitution of (eq 2) m (eq 1) gives 

y = sin to/ /w sin to/ sin qt 

Trigonometric reduction of this formula gives 

y = sin to/ + d”* [cos (<•>/ — qt) — cos (to/ -f- #/)] 

Thus we see that there are present m a wave, whose amplitude 
is modulated sinusoidally, three frequencies They are the 
onginal wave, or central band, and two side bands differing 
from the central band by an amount equal to the frequency of 
the modulation 

"Frequency Modulation The following mathematical de 
\ elopment (Ramsdell) will show how the frequency modulated 
wave may be represented as a spectrum of sinusoidal waves, 
having constant amplitudes and frequencies If the frequency 
is not modulated (assuming a sinusoidal wave form), the ampli 
tude of the wave is given at any instant (/) by the equation 

y = A sm bit (1) 

439 



440 


APPENDIX I 


where A is the maximum amplitude and to is the angular veloc 
lty of the generating vector, whose length is A In this case 
the angular velocity is assumed to be constant Frequency 
modulation requires, however, that the angular velocity be 
variable, and that the variability itself be a function of time 
Thus y ~ A sm tot becomes y = A sin to (which is itself a 
function of time) t When the angular velocity or its related 
measure, frequency, is varied m such a manner that the varia 
tion of frequency with time is sinusoidal, to itself is a function 
of time and may be represented as 

to — P 4* A sm qt 

where p = the initial value of to in the equation for the un 
modulated wave, h = amplitude of the frequency variation, 
q = rate of frequency variation The angle, 9 , through which 
the vector, A, has rotated in time, /, may be expressed 

6 = J wdt — f'iP 4- /* sm qt) dt 

Thus 

y — Asm o>t 
becomes 

y = A sm f (p h sin qt) dt (2) 

Jo 

~ A sm [pt — h/q cos qt] 

~ A sm [(pt -f- h/q) — h/q cos qt] (3) 

In the trigonometric relation 

sm (x — y) — sin x cos y — cos x sm y, 
let 

x = (pt 4* h/q) and y = h/q cos qt , 
then (eq 3) becomes 

y = A [sm (pt + h/q) cos (h/q cos qt) — cos (pt 4- h/q) 
sin (h/q cos qt)\ (4) 



APPENDIX I 


441 


From the Fourier developments: 

cos (Z cos 8) = /o(Z) — 2/*(Z) cos 20 — 2/ 4 (Z) 
cos 40 -f- 2/e(Z) cos 69 . . . 

sin (Z cos 0) = 2/i(Z) cos 8 — 2/ s (Z) cos 30 -f 2/ 5 (Z) cos 
59... 

in which /« (Z) is the Bessel Function of the first kind and the 
nth order for the argument Z. 

Substitute in (eq. 4) these equations for the products of 
trigonometric values, i.e., for cos (h/q cos qt) in the first 
term and sin ( h/q cos qt) in the second term, and it becomes: 

y — A [-(sin (pt + h/q)\\].(h/q)—2],(h/q) cos 
2 qt — l]<(h/q) cos Aqt + 

—{cos (pi + h/q)\\2J,(h/q) cos qt — 2],(h/q) cos 
3qt — . . .}•]. (5) 

y = A [h(h/q) sin (pt + h/q) 

— 2/,(h/q) cos qt -os (pt + h/q) 

— 2 Ji(h/q) cos 2 qt sin (pi -f h/q) 

+ 2J,(h/q) cos3 qt cos (pt-’r h/q)— 0a) 
The product of trigonometric values may be expressed as sums: 
cos x cos y — 4 cos (x -J- y) + i cos (* — y ) 
sin x cos y = i sin (x -1- y) + \ sin ( x — y ), 
which further simplifies (eq. 5a), 

y = A\h(h/q) sin (pt + h/q) 

— h(h/q) [cos (pt + h/q + qt) + cos (pt + h/q — qt) 1 

— hWq) [sin (pt + h/q -f 2 qt) + sin (pi + h/q — 2qt) 

+ ---1K (6) 

An examination of (eq. 6) shows that when p is varied 
sinusoidally by the amount h at q times per second, frequencies 
are produced in addition to the one at p and lie on both sides of 
it. There are, mathematically, an infinite number of these side- 
bands, spaced by multiples of the rate of modulation ( p : fc nq). 



442 


APPENDIX I 


The relative amplitudes of the side bands are to each other as the 
numbers which express the Bessel coefficients and are deter- 
mined by the ratio h/q The smaller the value of the ratio, the 
smaller are the amplitudes of the side bands 

The value of h/q is used in entering a table of Bessel co- 
efficients (Gray, Matthews, and MacRobert) in order to find 
the amplitude of the central frequency and of the various side 
bands Examples of the sound spectra for different values of 
h/q are shown in Fig 167 


o s | 10 I 15 



20 is 30 

iI IiILl- ■.■■■lI.L II .I, 1.1 1.1 ill,,. 

Fic 167 The relative amplitudes of the components produced by a fre 
quency modulation in which the ratio of range to rate is as indicated on each 
plot The components arc spaced apart by a number of cycles equal to the 
rate of modulation For the modulations represented here, the rate is assumed 
to be constant and the range variable. 

The frequency modulation treated above is assumed to be 
smusoidal with time Van der Pol (2) has developed a series 
giving the coefficients (amplitudes) of the sidebands in the 
case of a ‘square topped’ modulation, in which the frequency 
alternates abruptly between two values 




APPENDIX II 


It has been customary to treat the mechanical properties of 
the cochlea from the pomt of view of a series of tuned elements 
Thus Wegel and Lane proposed an electrical analogue of the 
cochlea consisting of a row of series resonant circuits connected 
in parallel but separated by an appropriate inductance Such a 
circuit would simulate the properties of the cochlea, for the 
basilar membrane behaves as if it consisted of a row of tuned 
resonators The behavior of Wegel and Lane’s circuit could be 
described by an appropriate set of differential equations 

It is also possible, however, to treat the cochlea from the 
pomt of view of a hydraulic system contained in a vessel with 
elastic walls, and to apply to it the theory of propagated disturb 
ances m media constrained by elastic boundaries The differ 
ential equations then take on a different form Reboul set up 
these differential equations on the basis of the assumption that 
a disturbance is oropagated along a fluid column contamed in 
a tube with elastic walls The action of the cochlea would then 
be as follows Movement of the oval window (stapes) starts 
a compression wave which travels toward the helicotrema If 
the wave is sufficiently sharp (high frequency) so that the varia 
tions m pressure are not transmitted to the scala tympam 
through the helicotrema (see p 276), we can regard the tube in 
which the propagation occurs as being the scala vestibuh 
TYm tanaY is bounded jfoove Yiy a srAid waVi and beYow by an 
elastic wall the basilar membrane For a given frequency, one 
part of this membrane will vibrate more than any other Fur- 
thermore, there will be a varying pressure gradient across the 
membrane which, in general, will not be m phase with the 
displacement of the membrane 

Then, if y = / (x,t) represents the displacement of a pomt 
in the medium as a function of the distance x from the oval 
443 



4-14 


APPENDIX n 


window and of the time t, the velocity of the point is given by 

dy 

11 — dt *^ ie rc l atlons between the pressure p and x and t 
are given by 


1 


^_ 0 


Po 


du dp 

at + ox 


= 0 


where po is the density of the fluid and c is the speed of propaga 
tion of the pressure wave The value of c could be calculated 
from the diameter of the cochlear canals and the elastic con 
stants of its walls and of the fluid within it (The formulas for 
dealing with this type of problem are used to calculate the speed 
of pulse waves in arteries ) 

The values of the various constants necessary for an exact 
solution of these differential equations arc not precisely known 
for the cochlea, but, by making certain reasonable assumptions, 
Reboul was able to show that the speed of propagation of the 
pressure wave along the basilar membrane is of the order of 
50 meters per second, which confirms the slow speed observed 
experimentally (see p 280) Furthermore, the basilar mem 
brane undergoes maximal displacement at a position which is a 
function of frequency The maximum is near the hehcotrema 
at low and near the oval window at high frequencies In 
addition to the maximal displacement, there is a maximal pres 
sure gradient across, or through, the membrane which also 
occurs at different places for different frequencies The pres 
sure gradient does not occur in the same phase and position as 
the displacement 

It is conceivable, as Reboul points out, that the stimulating 
factor for the end-organ (hair-cells) is the distortion produced 
by this pressure gradient itself, rather than the distortion pro- 
duced by a displacement of the membrane Experimental 
evidence has not as yet decided this point (see, however, p 343) 

This point of view regarding the dynamics of the cochlea 



APPENDIX II 


445 


which considers the acoustic stimulus as a propagated disturb 
ance in an elastic tube, rather than as a forced vibration 1 m 
pressed upon a set of resonators, has the additional advantage 
that it enables us to see how the cochlea can behave as an 
analyzer in spite of a large dampmg factor A simple system 
which is critically damped will not have a maximum in its 
resonance-curve (cf Fig 3, p 12, and see A H Davis, p 15) 
and a set of such systems could not, therefore, serve as an an 
alyzer Hence, if we were to treat the cochlea as a set of 
resonant systems, we should have to assume a damping less than 
critical No such restriction is necessary, however, when we 
treat the cochlea as a hydrodynamic system in an elastic tube, 
for a maximum of displacement of the basilar membrane can 
be obtained m spite of a large damping factor Thus it is 
possible for the basilar membrane to act as an analyzer, and at 
the same time show no free vibrations after a stimulus has 
ceased 



446 


APPENDIX III 


The following table relates ratios to decibels When we know the ratio 
of two pressures or velocities m a plane progressive sound wave, or of two 
currents or voltages operating in the same or equal impedances, we can find the 
corresponding number of decibels by entering the table. 

Power ratios can be converted into decibels by means of the simple rule 
that the number of decibels corresponding to a given ratio of powers is one- 



APPENDIX III 


447 


half of the number corresponding to the same ratio of voltages In other 
■words, when dealing with power, find from the table the number of decibels 
corresponding to the desired ratio and divide by 2 

For ratios outside the range of this table add 20 db for every tenfold 
increase. (Courtesy General R.J 10 Co ) 








GLOSSARY 


Absolute pitch Absolute pitch refers to the ability possessed by certain 
people to name the musical pitch of a note without the aid of a standard 
of reference 

Acoustic impedance The acoustic impedance of a sound medium 
on a given surface lying m a wave front is the complex quotient of the 
sound pressure (force per unit area) on that surface by the flux 
(volume velocity, or linear velocity multiplied by the area) through the 
surface When concentrated rather than distributed impedances are 
considered, the impedance of a portion of the medium is defined by 
the complex quotient of the pressure-difference, effective in driving that 
portion, by the flux (volume velocity) The acoustic impedance may 
be expressed in terms of mechanical impedance, the acoustic impedance 
is equal to the mechanical impedance divided by the square of the area 
of the surface considered The unit is the acoustic ohm 
Acoustic ohm. An acoustic resistance, reactance, or impedance is said 
to have a magnitude of one acoustic ohm when a sound pressure of 1 
dyne per square centimeter produces a volume velocity of 1 cc per sec 
Action potential An action potential is the electric potential gen 
crated between an active and an inactive region in a living tissue when 
ever an element of the tissue (nerve or muscle fiber) is activated The 
energy generating the action potential is supplied by the metabolic 
processes of the tissue and not by the stimulus to activation The 
action potential behaves in an all-ornone fashion (Cf distortion 
potential ) 

The distinction between action potential and action-current is 
as follows 

Action-current is the current, derived from a living tissue, which 
flows in an external circuit as a result of the electrochemical processes 
associated with functional activity of the elements of the tissue 

Action potential is the difference of electric potential generated 
between two points in a tissue as a result of its functional activity The 
action potential is measured by an instrument which draws a negligible 
amount of current 

Attribute A tonal attribute is an aspect of the sensation produced by a 
tonal stimulus Each attribute is defined by a differential reaction to 
a tone by a listener under a particular set, or Aujgabe Four aspects 
449 



450 


GLOSSARY 


can be distinguished in stimulation by pure tones They are pitch, 
loudness, volume, and density 

Audiogram An audiogram is a graph expressing hearing loss as a 
function of frequency 

A z i m uth The azimuth of a sound refers to the angular direction of 
the source relative to the listener 

Beats Beats are the periodic variations of the amplitude of the sound 
pressure at a point due to the interference of two sound waves of differ 
ent frequencies 

Bel (b) The bel is the unit of a logarithmic scale expressing the ratio 
of two amounts of power The number of bels denoting such a ratio 
is the logarithm to the base 10 of this ratio 

Combination tone A cambiaauon tone is produced when two tones 
act simultaneously on a nonlinear transducer The combination tone 
may have a frequency equal to the difference between the two tones or 
any ol their harmonics (difference tones), or it may have a frequency 
equal to the sum of two tones or any of their harmonics (summation 
tones) 

Cycle (^ ) One complete set of the recurrent values of a periodic 
quantity comprises a cycle (Cf frequency ) 

Decibel The decibel is one tenth of a bel The number of decibels 
denoting the ratio of two amounts of power is 10 times the logarithm 
to the base 10 of this ratio The abbreviation db is commonly used 
for the term decibel 

When the conditions arc such that ratios of currents or ratios of 
voltages (or analagous quantities in other fields such as pressures, 
amplitudes, or parUcle velocities in sound) are the square roots of the 
corresponding power ratios, the number of decibels by which the corre 
sponding powers differ is expressed by the following formulas 

n = 20 log t0 (/,//*) db 
n = 20 logjo (Vt/VJdb 

where I,/I 2 and VJV t are the given current and voltage ratios 
respectively 

By extension, these relations between numbers of decibels and ratios 
of currents or voltages arc sometimes applied where these ratios arc not 
the square roots of the corresponding power ratios but, to avoid con 
fusion, such usage should be accompanied by a specific statement of this 
application (See Appendix III for a tabic relating voltage ratios to 
decibels ) 



GLOSSARY 


451 


Density. Density is that aspect of auditory sensation in terms of which 
sounds may be ordered on a scale running from ‘dense’ to ‘diffuse ’ 
The density of a tone increases with increased intensity and also with 
increased frequency 

Dichotic stimulation Dichotic stimulation refers to the simultaneous 
stimulation of both ears, but with a different stimulus in each ear 

Difference-limen (DL) The difference limen is defined as the mini 
mal increment in a stimulus needed to produce a just noticeable differ 
ence in sensation The relative difference limen is the ratio of the DL 
to the value of the stimulus to which it is added 

Distortion potential A distortion potential is the electric potential 
generated by the deformation of a living cell The energy of the 
distortion potential is supplied by the distorting force, and the distor 
tion potential does not behave in all-ornone fashion (Cf action 
potential ) 

Dyne per square centimeter A dyne per square centimeter is the 
unit of sound pressure A dyne is defined as the force which will 
produce a change of velocity of one centimeter per second in a gram 
mass m one second 

Electrophomc effect The electrophomc effect refers to the ability of 
an alternating current, of suitable frequency and intensity, to arouse a 
sensation of hearing when passed through a person’s head 

Equilibration Equilibration refers to the process by which the activity 
in a nerve subjected to repetitive stimulauon achieves a steady state 
The initial burst of activity m a nerve, as measured by the action 
potential, is greater than the final level of activity reached after pro- 
longed stimulation 

Forced vibration A forced vibration is any vibration which is lm 
posed upon a system by an external force and whose frequency is 
controlled thereby 

Fourier's theorem Any function which, within an interval, is single 
valued, finite, and continuous may be represented by a series of 
sinusoidal functions whose frequencies are in harmonic relation The 
application of this theorem is not limited to periodic functions 

Free wave (free progressive wave) A free wave is a sound wave 
free from interference-effects 

Frequency The number of cycles occurring per unit of time, or which 
would occur per unit of time if all subsequent cycles were identical 
with the cycle under consideration, is the frequency The frequency 
is the reciprocal of the period The unit is the cycle per second The 



452 


GLOSSARY 


expression cycles per second is usually reduced to the single word cycles 
wherever this usage ts unambiguous 
Fundamental frequency. A fundamental frequency is the lowest 
component frequency of a periodic wave or quantity 
Harmonic A harmonic is a component of a periodic wave or quantity 
having a frequency which is an integral multiple of the fundamental 
frequency For example, a component whose frequency is twice the 
fundamental frequency is called the second harmonic 
An crural harmonic is a harmonic generated in the ear as a result 
of nonlinearity and asymmetry in the auditory transducer 
Hearing loss Hearing loss is measured as the number of decibels that 
the intensity of a tone must be raised bejond the normal threshold 
value for that tone, in order that a deafened ear may detect it 
The percentage of hearing loss at a given frequency is 100 tunes the 
ratio of the hearing loss in decibels to the number of decibels between 
the normal thresholds of audibility and of feeling at that frequency 
Intensity Intensity refers to a dimension of a stimulus It is a measure 
of the strength or magnitude of the stimulaUng agent In plane pro 
gressivc sound waves, intensity is usually measured m terms of pressure 
oc energy flow (power), but, whenevec the power is not proportional 
to the square of the pressure, energy flow alone should be taken as the 
measure of intensity (See sound intensity ) 

Intensity level The intensity level, in decibels, of a sound is 10 times 
the logarithm to the base 10 of the ratio of the intensity / of this sound, 
to the reference intensity /„ In other words, intensity level is the 
number of decibels that a sound is above the reference intensity 
Interference pattern An interference pattern is the spatial distribu 
turn of pressure, particle velocity, energy density, or energy flux which 
occurs when sound waves of the same frequency are superposed 
Isophomc contours An isophonic contour gives the values of fre 
quency and intensity, of a pure tone, which produce a sensation, one 
of whose attributes has a constant value Isophonic contours are plots 
of frequency vs intensity with the tonal attributes as parameters The 
isophonic contours comprise contours of.cqual pitch, equal loudness, 
equal volume, and equal density 

loudness Loudness is that aspect of auditory sensation m terms of 
which sounds may be ordered on a scale running from ‘soft’ to 'loud 
Loudness is chiefly a function of the intensity of a sound, but it is also 
dependent upon the frequency and the composition The unit is the 
sone 



GLOSSARY 


453 


Loudness leveL The loudness level of a sound is the intensity level of 
a 1000-cycle tone which sounds equal to the sound in loudness Loud 
ness-le\el is measured in decibels or phons above the reference intensity 
The 1000-cycle tone is the reference tone for loudness-comparisons, and 
the loudness level of all other sounds is expressed in terms of the equally 
loud reference tone 

Masking Masking is defined as the number of decibels by which a 
listener’s threshold of audibility for a given tone is raised by the pres 
ence of another sound The graph showing the elevation of the 
threshold for various frequencies due to a masking sound is known as 
the masking audiogram of that sound 

Mechanical impedance The mechanical impedance of a system is 
the complex quotient of the alternating force applied to the system by 
the resulting alternating linear velocity in the direction of the force at 
its point of application The unit is the mechanical ohm or the d>ne 
second per centimeter 

MeL The mel is a unit of pitch It is so defined that a 1000-cycle tone 
40 db above threshold has a pitch of 1000 mels (The mel is a so-called 
‘subjective* unit.) 

Microphomc A microphomc is the electric potenual produced by a 
transducer which converts vibratory into electrical energy The alter 
naung potential produced by the cochlea m response to a stimulating 
sound is an aural or cochlear microphomc 

Modulation Any periodic alteration of a parameter of a vibratory phe 
nomcnon produces modulation Modulations of sound waves can be pro- 
duced by varying the frequency, the intensity, or the phase of the wave 

Natural frequency The natural frequency of any system is the fre 
quency at which its vibrating element will vibrate after the external 
force displacing it from its normal position has ceased to act The 
unit is the cycle per second 

Natural period The natural period is the reciprocal of the natural 
frequency The unit is the second 

Neuron A neuron is an entire nerve-cell, including cell body, axon 
and dendrites 

Octave An octave is the interval between two frequencies having a 
ratio of 2 to 1 One octave is equal to 1200 musical cents 

Operating point The operating point of an electrical or mechanical 
system which is subject to alternating forces is its position or state when 
no alternaung force is applied Under an alternating force the system 
moves back and forth about its operating point. 



454 


GLOSSARY 


Overtone An overtone is a partial having a frequency higher than 
that of the basic frequency 

Partial A partial is a component of a complex tone Its frequency 
may be either higher or lower than that of the basic frequency and may 
or may not bear an integral relation to the basic frequency 
Particle-velocity The particle velocity in a sound wave is the mstan 
taneous velocity of a given infinitesimal part of the medium, with refer 
ence to the medium as a whole, due to the passage of the sound wave 
Period (T) The time required for one cycle of a periodic quantity is 
the period The unit is the second 

Phase The phase of a sound wave, at a given instant, is the part of 
the cycle m which the wave finds itself at that instant, relative to some 
arbitrary reference point Phase is measured in degrees or radians 
Phon The phon is a unit for measuring the loudness level of a tone 
The number of phons is equal to the number of decibels that a 1 000 
cycle tone is above the reference intensity when judged equal in toud 
ness to the tone m question 

Piezoelectricity Piezoelectricity is the electricity or electric polarity 
produced by pressure on an appropriate body Certain crystallized 
substances such as quartz, commonly exhibit a piezo (pressure) effect 
Pitch Pitch is that aspect of auditory sensation in terms of which 
sounds may be ordered on a scale running from low to ‘high Pitch 
is chiefly a function of the frequency of a sound but it is also dependent 
upon the intensity and the composition The unit is the mel 
Pressure level The pressure level in decibels of a sound is twenty 
times the logarithm to the base 10 of the ratio of the pressure P of this 
sound to the reference pressure P 0 Unless otherwise specified, the 
reference pressure is understood to be 00002 dyne per square ccnti 
meter 

Pure lone A pure tone is a sound produced by an instantaneous soun I 
pressure which is a simple sinusoidal function of time 
Reference-frequency The reference frequency or reference tone, f< r 
loudness-comparisons is a tone of 1000 cydes per second 
Reference intensity (7 0 ) The reference intensity in acoustics is taken 
as 10" 1 * watt per square centimeter In a plane progressive sound 
wave in air, this value corresponds to a root mean square pressure of 
0 0002 dyne per square centimeter 

Refractory period The refractory period is the period of time follow 
ing the excitation of a nerve or musde fiber, during which the fiber is 
either absolutely or relatively mexcitable 



GLOSSARY 


455 


Resonance (velocity resonance) Resonance exists between a body, 
or system, and an applied sinusoidal force if any small change in the 
frequency of the applied force causes a decrease jn velocity at the 
driving point, or if the frequency of the applied force is such that the 
absolute value of the dm ing point impedance is a minimum 
Resonant frequency A resonant frequency is a frequency at which 
resonance exists The unit is the cycle per second 
Sensation level The sensation level of a given sound is the number 
of decibels that the sound is above its normal threshold of audibility 
Sone The sone is a unit of loudness It is defined as the loudness of 
a 1000-cycle tone 40 db above threshold (The sone is a so-called 
‘subjective* unit ) A millisone is one thousandth of a sone 
Sound, (<*) Sound is an alteration m pressure, particle-displacement, 
or particle velocity propagated in an elastic material, or the super 
position of such propagated alterations 

(b) Sound is also the sensation produced through the ear by the 
alterations described above 

Sound-energy density (E) Sound-energy density is the sound-energy 
per unit volume The unit is the erg per cubic centimeter 
Sound-energy flux (J) The sound-energy flux is the average over 
one period of the rate of flow of sound-energy through any specified 
area The umt is the erg per second 
Sound intensity (J) The sound intensity of a sound field in a specified 
direction at a point is the sound-energy transmitted per unit of time in 
the specified direction through a unit area normal to this direction at 
the point The unit is the erg per second per square centimeter, but 
sound intensity may also be expressed in watts per square centimeter 
Sound pressure The effective sound pressure at a point is the toot 
mean square value of the instantaneous sound pressure ov er a complete 
cycle, at that point The unit is the dyne per square centimeter 
Spectrum A spectrum is the distribution of energy among the com 
ponent frequencies of a vibratory phenomenon An acoustic spectrum 
refers to the frequencies, intensities, and phases of the components of a 
sound 

Subhannomc A subharmonic is a component of a complex wave 
having a frequency which is an integral submultiple of the baste 
frequency 

Synapse A synapse is the connection or region of contact between 
two neurons A nerve impulse progressing along the axon of one 
nerve-cell must cross a synapse before reaching the dendrite or cell body 



456 


GLOSSARY 


o£ the next nerve<dl The transmission of impulses across synapses 
is subject to different laws from those goiermng transmission in nerve 
fibers 

Synchronized action potentials Action potentials in different nerve 
fibers are said to be synchronized with one another when they arrive at 
a given point in the nerve within an interval of time which is short 
relative to the duration of the action potentials They are synchronized 
with sound waves when they are initiated in a definite phase relation 
with the sound waves 

Thermal noise A thermal noise is the noise produced by the random 
vibration of the molecules of the air due to thermal agitation Thermal 
noise is also produced when the potential due to the random agitation 
of electrons in an electrical conductor is amplified and impressed on a 
loud speaker In a thermal noise all frequencies are present and the 
spectrum of the sound is continuous 

Threshold of audibility The threshold of audibility at any specified 
frequency is the minimal value of sound pressure which produces a 
tonal sensation In specifying the threshold value of sound pressure, 
the point at which the pressure is measured must be stated 

Transducer A transducer is a device by means of which energy may 
flow from one or more transmission systems to one or more other trans 
mission systems A microphone is a transducer, because by means of 
it, energy flows from an acoustic into an electrical system Likewise 
the ear is a transducer 

Utilization tune Utilization time is the minimal duration m which a 
sumulus of given intensity is effective in initiating a nerve impulse 

Vibrato The vibrato is a musical embellishment consisting of a rapid 
rise and fall in the frequency (or intensity) of a note at the rate of 
about seven fluctuations per second 

Volume Volume is that aspect of auditory sensation in terms of which 
sounds may be ordered on a scale running from ‘small ’ to ‘large * The 
volume of a tone increases with increased intensity hut decreases with 
increased frequency 

The word 'volume* is commonly used by radio engineers to refer to 
the intensity of a sound and should not be confused with volume as 
defined above 

Wave-length (A) The wave length of a periodic wave in an isotropic 
medium is the perpendicular distance between two w*ave fronts in 
which the displacements have a phase-difference of one complete cycle 



REFERENCES 


Note The following references are listed alphabetically by authors 
Where more than one title follows the name of a single author, the titles 
are numbered These are the numbers which appear m the text following 
the name of the author 

Abraham, O Zur psychologischen Akustik von Wellenlange und Schwmgungs- 
zahl Zsch f Sinnesphysiol , 1920, 51, 121-152 
Ades, H W, Mettler, F A , and Culler, E A Amer J Physiol (in press, 

1938) 

Adman, E. D The rmcrophcmic action of the cochlea an interpretation of 
We\er and Bray s experiments J Physiol , 1931, 71, xxvm-xxx 
Adrian, E D , Bronx, D W , and Phillips, G The nervous origin of the 
Wever and Bray effect J Physio! , 1931, 73, 2P-4P 
American Otolocical Society Symposium Hearing by bone conduction 
Ann Oto! n Rhinol and Laryngol , 1936, 45, 735-864 
Andrade, E N da C On the circulations caused by the vibration of air in 
a tube Proc Roy Soc London, 1931, 134A, 445-470 
Andreef, A M, Gersuni, G V, and Volorhov, A A On the electrical 
excitability of the human ear On the effect of alternating currents on 
the affected auditory apparatus J Physiol USSR, 1935, 18, 250-265 
Angell, J R, and Fite, W The monaural localization of sound- Psychol 
Rev, 1901, 8, 225-246 

Ashcroft, D W , and Hallpike, C S On the function of the saccule J 
Laryngol and Otol, 1934, 49, 1-1 1 

Ashcroft, D \V , Hallpike, C S , and Rawdon-Smith, A F On the changes 
in histological structure and electrical response of the cochlea of the 
cat following section of the VUIth nerve. Proc Roy Soc London, 
1937, 122B, 186-197 

Bachem, A Various types of absolute pitch J Acous Soc. Amer , 1937, 9, 
146-151 

Baier, D E The loudness of complex sounds J Exper Psycho! , 1936, 19, 
280-307 

Banister, H Audition I Auditory phenomena and their stimulus corre 
lates In A handbook of general experimental psychology ed by 
C. Murchison Worcester Clark Umv Press, 1934 Pp 880-923 
Barnes, R B , and Czerny, M Lasst sich cm Schroteffekt der Photonen mit 
dem Auge beobachten? Zsch f Physik, 1932, 79, 436-449 
Barron, D H , and Matthews, B H C Intermittent conduction m the spinal 
cord. J Physiol , 1935, 85, 73-103 

Barton, EH A textbook on sound London Macmillan, 1914 (See esp 
P9) 

Bast, T H, and Eyster, J A E In Symposium on tone localization in the 
cochlea Ann Otol , Rhinol and Laryngol , 1935, 44, 792-803 
Beatty, R. T. Hearing in man and animals London Bell, 1932 
Bekesy, G v (1) t)ber den Einfluss der mchtlinearcn Eiscnverzcrrungen auf 
457 



458 


REFERENCES 


die Gute und Verstandhchkeit ones Telephonic Obcrtragungssystemes 
Elek. Nachr-Techn , 1928, 5, 231-246 

(2) Zur Theorie dcs Horens Die Schwingungsform der Basilarrncmbran 
Physik Zsch., 1928, 29, 793-810 

(3) Zur Theorie des Horens Ober die Bcstimmung dcs einem rnnen 
Tonempfinden cntsprechenden Erregungsgebtetes der Basils rmemb ran 
venmttds Ermudungsenchemungen Physik Zsch, 1929, 30, 115-125 

(4) Zur Throne des Horens Ober die eben merkbare Amplitudcn und 
Frequenzandcrung eines Tones Die Throne der Schwebungen Physik. 
Zsch , 1929, 30, 721-745 

(5) Ober das Richtunghoren bei einer Zejtdifferenz oder Lautstarken 
ungleichheit der beiderseitigen Schalleinwirkungcn Physik Zsch , 
1930, 31, 824-835, 857-868 

(6) Ober das Fecbnersche Geselz und seine Bcdeutung fur die Tlicorie 
der akustischen Beobaehtungsfehler und die Theone des Horens 
Ann d. Physik, 1930, 7, 329-359 

(7) Sur la theone de 1 audition Ann dcpsychol , 1930,31, 63-96 

(8) Ober die Mcssung der Schwmgungsamphtudc fester Korper Ann d 
Physik, 1931, 11, 227-232 

(9) Bcraerkung zur Theorie der gunstigsten Naehhalldauer von Raumen 
Ann d Physik, 1931, 8, 851-873 

(10) Ober die Ausbreitung der Schatlwellen m anisotropen dunnen Platten 
Zsch f Physik, 1932, 79, 668-671 

(11) Ober den Einfluss der durch den Kopf und den Gehorgang bewirfcten 
Sehallfeldverzerrungen auf die Horschwelle. Ann d Physik, 1932, 
14,51-56 

(12) Zur Theone des Horens bei der Schalbufoahme durch Knochenleitung 
Ann d Physik, 1932, 13, 111-136 

(13) Ober die Sehallfeldverzerrungen in der Nahe von absorbierenden 
Flachen und ihre Eedeutung fur die Raumakusuk Zsch f techn 
Physik, 1933, 14, 6-10 

(14) Ober die Horsamkcit der Em und Ausschwingvorgange mit Beruck- 
siehtigung der Raumakustik Ann d Physik, 1933 16 844-860 

(15) Ober den Knall und die Theorie des Horens Physik Zsch , 1933, 34, 
577-582 

(16) Ober die Horsamkcit kleiner Musikraume Ann d Physik, 1934, 20 
665-679 

(17) Ober die nichtlinearen Verzemmgen dcs Ohrcs Ann d Phjsik, 
1934,20 809-827 

(18) Ober die Horsamkcit von Konzert und Rundfunksalen Elek Nachr- 

, IBM, V«, 3/&-SJS 

(19) Physikabschc Probleme der Horphysmlogie Elek Nachr-Techn 1935, 
12,71-83 

(20) Ober akustischc Reizung des Vcstibularapparates Pflugers Arch f d 
ges Physiol, 1935, 236 59-76 

(21) Ober akustische Rauhigkeit. Zsch f techn Physik, 1935, 16, 276-282 

(22) Ober die Horschwelle und Fuhlgrenze langsamcr sinusfSrmiger 
Luftdruckschwankungen Ann d Physik, 1936, 26, 554-566 



REFERENCES 


459 


(23) Fortschntte der Horphysiologie. Zsch f techn Physik, 1936, 12, 
522-528 

(24) Ober die Herstellung und Messung langsamer smusformiger Luft 
druckschwankungen Ann d Physik, 1936, 25, 413-432 

(25) Zur Physik dcs Mittclohres und fiber das Horen ba Fehlcrhaftem 
Ttommelfell Akust. Zsch, 1936, 1, 13-23 

Boring, E G (1) Auditory theory with special reference to intensity, volume 
and localization Amer J Psychol, 1926, 37, 157-188 

(2) A history of psychology New York Appleton-Century, 1929 (See 
esp p 281 ) 

(3) The physical dimensions of consciousness New York Appleton 
Century, 1933 

(4) The relanon of the attributes of sensation to the dimensions of stimulus 
Phil So, 1935, 2, 236-245 

Borinc, E G, and Stevens, S S The nature of tonal brightness Proc Nat 
Acad Set, 1936, 22, 5 14-52 1 

Boring, E G , and Titchener, E B Sit Thomas Wnghtson s theory of hear 
ing Amer J Psychol, 1920,31, 101-113 

Bowen, R. E The cupula of the membranous labyrinth ] Comp Neurol , 
1933,58.517-539 

Brecher, G A Die untere Hor und Tongrenze Pflfigers Arch f d g a. 
Physiol, 1934, 234,380-393 

Brocden, W J , Girdin, E , Mettler, F A , and Culler, E Acoustic value of 
tire several components of the auditory system in cats Amer J Physiol, 
1936, 116, 252-261 

Bronstein, A J Sensibilization of the auditory organ by acoustic stimuli 
Bull, de biol et de med exper, 1936, 1, 274-275 276-277, 2, 347-349 

Bronstein, A J, and Chwulova, E A The dependence of tune taken to 
restore the initial excitability of the hearing apparatus upon the pitch 
of the acting tone Bull de biol et de med exper 1936, 1, 428-436 

Bunch, C C Age variations in auditory acuity Arch Otolaryngol, 1929, 9, 
625-636 

Burck, W, Kotowski, P , and Lichte, H (1) Die Lautstarke son Knacken, 
Gerauschen und Tonen EIck Nachr^Techn , 1935, 12, 278-288 

(2) Der Aufbau dcs Tonhohenbewusstseins Elek Nachr.-Techn , 1935, 12, 
326-333 

(3) Die Horbarkeit von Laufzeitdifferenzen. Elek. Nachr.-Techn, 1935, 
12, 355-367 

(4) Die Horbarkeit von Knacken und kurzdauernden Tonen Zsch f 
tcchn Physik, 1935, 12,516-519 

(5) Ausgleiehsvorgange in elektroakustischen Obertragungsanlagen Zsch 
f tcchn Physik, 1935, 12, 519-522 

(6) Horbarkeit von Regelvorgangcn in dynamikgeregelten Verstarkera und 
Film Rantonsystemen Zsch f techn Physik, 1935, 12, 522-525 

(7) Frequenzspektrum und Tonerkennen Ann d Physik, 1936, 25, 
433-449 

(8) Loganthmische und Imeare Lautstarkenskala Ann d Physik, 1936, 27, 
664-668 



460 


REFERENCES 


(9) Die Lautstarke von Knackfolgen Hochfrcquenztechn u Electroakus. 
uc, 1936, 47, 33-37 

(10) Dynamikgeregelte Verstarker und Klanonsteurungen Elek. Naebr 
Tedin, 1936, 13, 47-73 

(11) Die Hallwirkung von Raumcn, ihre Messung und ihre Nachbiidung 
Elek. NachryTechn, 193$, 13, 268-280 

Buytendijk, F J J On the negative variation of the nervus acustieus caused 
by a sound Proe, Roy Soc Amsterdam, 1910, 13, 649-652 
Cannon, W B, and Rosenblueth, A Autonomic neuro-effector systems 
(Exper Biol Monog ) New York Macmillan, 1937 
Chapin, E K, and Firestone, F A The influence of phase on tone quality 
and loudness, the interference of subjective harmonics J Acous Soc. 
Amer, 1934, 15, 173-180 

Ciiubcher, B G A loudness scale for industrial noise measurements J 
Acous Soc Amer, 1935, 6, 216-226 

Churches, B G , and Kinc, A J The performance of noise meters in terms 
of die primary standard J Instit. Elec. Eng, 1937, 81, 57-90 
Churcher, B G, Kinc, A J, and Davies, H The minimum perceptible change 
of intensity of a pure tone Phil Mag, 1934, 18, 927-939 
Ctocco, A A statistical approach to the problem of tone locahzauoa in the 
human cochlea Human Biol , 1934 6, 714-721 
Covell, W P, and Black, L. J The cochlear response as an index to hearing 
Amer J Physiol, 1936, 116,524-530. 

Crowe, S J Anatomic changes in the labyrinth secondary to cerebellopontile 
and brain stem tumors Arch Surg, 1929, 18, 982-991 
Crowe, S J, Guild, S R, and Polvoct, L. M Observations on the pathology 
of high tone deafness Bull Johns Hopkins Hosp , 1934 54, 315-379 
Culler, E A In Symposium on tone localizaUon in the cochlea Ann. Otol , 
RhrnoL and Laryngol , 1935, 44, 809-815 
Culler, E A, Will man, J , and Mettlir, F A Mapping the cochlea. Amer 
J Physiol, 1937, 119,292 

Dahmann, H Zur Physiologic des Horens, expenmentellc Untersuchungen 
uber die Mrchamk der Gehfirknochelchenkerte, sowie uber deren Ver 
halten auf Ton und Luftdruck I Zsch f Hals- Nascn u Ohrenheil 
kunde, 1929, 24, 462-497 

Datta, A C A textbook of sound London Blackie, 1932 (?) 

Davis, A. H Modern acoustics New York Macmillan, 1934 
Davis, H (1) Audition III The physiological phenomena of audition In 
A handbook of general experimental psychology cd by C Murchison 
Worcester Clark Umv Press, 1934 Pp 962-986 

Q\ The electrical phenomena of the cochlea and the auditory nerve 
J Acous Soc Amer, 1935, 6, 205-215 

Dams, H, and Derbyshire, A J The mechanism of auditory masking 
Amer J Physiol, 1935, 113,34 

Dams, H, Derbyshire, A j, Kemp, E. H, Lurie, M H, and Upton, M 
Functional and histological changes in the cochlea of the guinea pig 
resulting from prolonged stimulauon J Gen. Psychol, 1935, 12, 
251-278 



REFERENCES 


461 


Davis, H, Derbyshire, A. J, and Lurie, M. H. A modification of auditory 
theory Arch. Otolaryngol, 1934, 20, 390-39} 

Davis, IL, Derbyshire, A. J, Lurie, M. H, and Saul, L. J The electric 
response of the cochlea Amer J PhysioL, 1934, 107, 311-332 

Dean, C. E. Audiuon by bone conduction. J Acous Soc. Amer, 1930, 2, 

281-297 

Derbyshire, A. J Action potentials of the auditory nerve. Thesis, Harvard 
University, 1934 

Derbyshire, A. J, and Davis, H. (1) The probable mechanism for stunula 
tion of the auditory nerve by the organ of Com. Amer J Physiol, 

1935,113,35 

(2) The action potentials of the auditory nerve. Amer J Physiol, 1935, 
113,476-504 

Drysdale, C. V Discussion on audition. Phys. Soc, 1931, 62-78 

Einthoven, W Sur les phenomcnes eiectnques du tonus musculaire. Arch. 
neerL de physiol, 1918, 2, 4S9-499 

Ekdahl, A. G, and Boring, E G The pitch of tonal masses. Amer J 
Psychol, 1934, 46, 452-455 

Ekdahl, A. G, and Stevens, S S The relation of pitch to the duration of a 
tone. (Unpublished, 1937) 

Erlancer, J, and Gasser, H. S Electrical signs of nervous activity Phila 
delplua Umv Pa Press, 1937 

Ewing, A. W G, and Little*, T S Auditory fatigue and adaptation Bnt. 
J Psychol, 1935,25,284-307 

Fay, R. D Plane sound waves of finite amplitude. J Acous Soc. Amer, 
1931, 3, 222-241 

Finch, G, and Culler, E. Effects of protracted exposure to a loud tone. 
Science, 1934, 80, 41-42. 

Firestone, F A. A new analogy between mechanical and electrical systems 
J Acous Soc. Amer, 1933, 4, 249-267 

Fletcher, U- (1) Speech and hearing New York Van Nostrand, 1929 

(2) A space-time pattern theory of hearing J Acous Soc. Amer, 1930, 1, 
311-343 

(3) Loudness, pitch and the timbre of musical tones and thar relation to 
the intensity, the frequency and the overtone structure. J Acous Soc. 
Amer, 1934, 6, 59-69 

(4) Newer concepts of the pitch, the loudness and the timbre of musical 
tones. J Franklin Instit, 1935, 220, 405-429 

Fletcher, H, and Munson, W A. (1) Loudness, its definition, measure- 
ment, and calculation J Acous Sot Amer, 1933, 5, 82-108 

(2) Relation between loudness and masking J Acous Sot Amer, 1937, 
9, 1-10 

Fletcher, H, and Wecel, R. L. The frequency-sensitmiy of normal ears 
Phys Rev, 1922, 2nd ser, 19, 553-565 

Forbes, A, Miller, R. H, and O Connor, J Electric responses to acoustic 
stimuli m the decerebrate animal Amer J Physiol, 1927, 80, 363-380 

Frederick, H. A. American tentative standard acoustical terminology J 
Acous. Sot Amer, 1937, 9, 60-71 



462 


REFERENCES 


Gage, F H (1) The measurability of auditory sensations Proc. Roy Soc. 
London, 1934, 116B, 103-119 

(2) The variation of the umaural differential threshold with simultaneous 
stimulation of the other ear by tones of the same frequency Brit. J 
Psychol ,1935, 25, 458-464 

Galt, R H Methods and apparatus for measuring the noise audiogram J 
Acous Soc Amer , 1929, 1, 147-157 

Geffcken, W Untersuchungcn fiber akustischc Schwellenwerte III Ober 
die Bestunmung der Reizsch Welle der Horempfindung aus Schwcllen 
druck und Trommelfellimpedanz Ann d Physik, 1934, 19, 829-848 

Geiger, PH, and Firestone, F A The estimation of fractional loudness 
J Acous Soc Amer , 1933, 5, 25-30 

Geisuni, G V, and Volokhov, A A On the electrical excitability of the 
auditory organ on the effect of alternating currents in the normal 
auditory apparatus J Exper Psychol , 193S, 19, 370-3S2. 

Gray, A, Matthews, G B , and MacRobert, T M Treatise on Bessel func- 
tions London Macmillan, 1922 

Guild S R (1) War deafness and its prevention — report of the labyrmths 
of the animals used m testing of preventive measures J Lab and Chn. 
Med, 1919,4,153-180 

(2) Correlations of histologic observations and the acuity of hearing Acta 
Oto-Laryngol, 1932, 17, 207-249 

(3) Hearing by bone conduction the pathways of transmission of sound 
Ann Oto! , Rhinol and Laryngol , 1936, 45, 736-753 

Guilford, J P Psychometric methods New York McGraw Hill, 1936 

Gundlach, R H Tonal attributes and frequency theories of hearing J 
Exper Psychol, 1929, 12, 187-196 

Gundlach R H , and Bentley, M The dependence of tonal attributes upon 
phase Amer J Psychol , 1930, 42, 519-543 

Guttman, J , and Barrera, S E The electrical potentials of the cochlea 
and auditory nerve m relation to hearing Amer J Physiol, 1937, 120, 
666-670 

Guttman, J , and Ham, L Masking effects of an interfering tone upon a 
deafened ear J Acous Soc Amer, 1930, 2, 83-94 

Hall, H H (1) A recording analyzer for the audible frequency range 
J Acous Soc Amer, 1935, 7, 102-1 10 

(2) Analysis of sound-waves J Soc. Motion Picture Eng , 1936, 27, 396- 
408 

Hallpike, C S (1) The precise anatomy of the Hensen s cells in the cochlea 
of the guinea pig and its physiological significance J Physiol , 1931, 
73, 8P 

(2) On the function of the tympanic mtisdcs J Laryngol and Otol, 
1935, 50, 1-5 

Hallpike, C S and Hartridge, H Electrical sUmulation of the human 
cochlea Nature, 1937, 139, 192 

Hallpike, C S., Hartridge, R, an d Rawdon-Swith, A F On the electrical 
responses of the cochlea and the auditory tract of the cat to a phase 



REFERENCES 


463 


reversal produced in a continuous musical tone. Proc. Roy Soc. Lon 
don, 1937, 122B, 175-185 

Hallpike, C S, and Rawdqn Smith, A F (1) The function of the tensor 
tympani muscle. J Physiol, 1934, 81, 25P-27P 

(2) The Wever and Bray phenomenon A study of the electrical response 
in the cochlea with especial reference to its origin J Physiol , 1934, 81, 
395-408 

(3) The origin of the Wever and Bray phenomenon J Physiol., 1934, 83, 
243-254 

(4) The Wever and Bray phenomenon — A summary of the data concern 
ing the origin of the cochlear effect Ann Otol , Rhinol and Laxyngol , 
1937, 46, 976-990 

Halverson, H M (1) Binaural localization of tones as dependent upon 
differences of phase and intensity Amer J Psycho]., 1922, 33, 178-212 

(2) Diotic tonal volumes as a function of differences of phase Amer J 
Psychol, 1922, 33, 526-534 

(3) Tonal volume as a function of intensity Amer J Psychol , 1924, 35, 
360-367 

(4) The upper limit of auditory localization Amer J Psychol , 1927, 38, 
97-106 

Ham, L. B, and Parkinson, J S Loudness and intensity relations J Acous 
Soc. Amer, 1932, 3, 511-534 

Hardy, M Observations on the mnervaUon of the macula saccuh in man 
Anat. Rec, 1934, 59, 403 

Hartley, R V L. Function of phase difference in binaural localization of 
pure tones Phys Rev, 1919, 13, 373-385 

Hartley, R. V L-, and Fry, T C The binaural location of pure tones 
Phys Rev, 1921, 2nd ser, 18,431-442 

Hartridge, H Effect of phase change on the human ear J Physiol, 1936, 
86, 64-65P 

Hartshorn, L- The audible effect of a sudden change of phase in the current 
supplied to a telephone receiver Phys Soc. Proc , 1937, 49, 194-197 

Helmholtz H L. F Sensations of tone. (Trans by A J Ellis) New 
York Longmans, Green, 1930 

Henney, K Principles of radio (3rd ed ) New York Wiley, 1938 

Hill, A V Chemical wave transmission in nerve Cambridge Umv Press 
New York Macmillan, 1932 

Hollinshead, M T A study of the vibrato in artistic viofin pfaying C/mv 
Iowa Stud, 1932, 1, 281-288 

Holway, A H, and Upton, M On the psychophysics of hearing III Th* 
locus of the stimulus threshold Proc Nat Acad Sci , 1938, 24 (in 
press) 

Horton, G P A quantitative study of hearing in the guinea pig ( Cavsa 
cobayd ) J Comp Psychol, 1933, 15, 59-73 

Howe, H A, and Guild, S R. Absence of the organ of Corti and its possible 
relation to electric auditory nerve responses Anat Rec, 1933, 55 
(suppl to No 4), 20-21 



464 


REFERENCES 


Hughes, J W The monaural threshold effect of a subhminal contralateral 
stimulus Proc Roy Soc London 193S, 124 B, 406-42Q 

Hughson, \V , and Crowe, S J Immobilization of the round window mem- 
brane a further experimental study Ann Otol, RhmoL and Laryngol, 
1932, 41, 332-349 

Hughson, W, and Witting, E G An objective study of auditory fatigue 
Acta Oto-Laryngol , 1935, 21, 457-486 

Hunt, F V A direct reading frequency meter suitable for high speed record 
sng Rev Scient. Instt , 1935, 6, 43-46 

Janovsky, W (Jber die Horbarkeit von Verzemmgen Elek Nachr.-Teehn , 
1929, 6, 421-439 

Jasper, H H Electrical signs of cortical activity Psychol Bull, 1937, 34, 
411-481 

Kemp, E H A critical review of experiments on the problem of stimulation 
deafness Psychol Bull , 1935, 32, 325-342 

Kemp, E H , and CopPee, G The latency of electric responses in the audi 
tory tracts of the brain stem Amer J Physiol , 1936, 116, 91-92 

Kemp, EH, Coppee G E , and Robinson, E H Electnc responses of the 
brain stem to unilateral auditory stimulation Amer J Physiol, 1937, 
120, 304-315 

Kemp, E H, and Robinson, E H Flectnc responses of the brain stem to 
bilateral auditory stimulation Amer J Physiol, 1937, 120, 316-322 

Knauss, H P An empirical formula for the loudness of a 1000-cycle tone 
J Acous Soc Amer , 1937, 9, 45-46 

Knudsen, V O (1) The sensibility of the ear to small differences in mten 
sity and frequency Phys Rev , 1923, 21, 84-103 

(2) Architectural acoustics New York Wiley, 1932. 

Knuosen, V O, and Jones, I H Bone conduction Arch Otolaryngol, 
1931, 13, 489-505 

Kobrak, H Zur Physiologic der Bmnenmuskeln des Ohrcs (Untersuch 
ungen zur Meehamk der Schallettungskette) Passow-Schaeffer Beitr, 
1930,28, 138 

Kocx, W E (1) On the principle of uncertainty in sound J Acous Soc. 
Amer, 1935, 7, 56-58 

(2) Certain subjective phenomena accompanying a frequency vibrato J 
Acous Soc Amer, 1936, 8, 23-25 

(3) A new interpretation of the results of experiments on the differential 
pitch sensitivity of the ear J Acous Soc Amer , 1937, 9, 129-134 

Koenig, R. Quelques experiences d acous tique. Pans Labure, 1882 

KornmOu-er, A E Die bioclekirischen Erscheinungen der Hironndcnfelder 

Krainz, W Das Knochcnlcitungsproblem. Expenmentelle Ergebmsse Zsch. 
£ Hals- Nasen u Ohrcnheilkunde, 1926, 15, 306-313 

Kuckarski, P Recberches sur 1 excitability audinvc en fonctioa du temps 
Ann. de psychol, 1928, 28, 1-74 

Lane, C E Minimum sound energy for auchoon for tones of high frequency 
Phys Rev, 1922, 2nd ser, 19, 492-497 



REFERENCES 


465 


Lapicque, L. The chronaxic switching in the nervous system Science, 1929, 
70, 151-154 

Lewis, D , and Cowan, M The influence of intensity on the pitch of violin 
and ’cello tones J Acous Soc Amer , 1936, 8, 20-22 
Lewis, D, and Larsen, M I The cancellation, reinforcement and measure- 
ment of subjective tones Nat Acad Sa , 1937, 23, 415-421 
Lewis, D, and Recer, S N An experimental study of the role of the 
tympanic membrane and the ossicles m the hearing of certain subjective 
tones J Acous Soc. Amer , 1933, 5, 153-158 
Lichte, H Physik der Gerausche. Vortrag, gehalten in der physikahschen 
Gesellschaft, Zurich an 18 Mai 1936 

Lifshitz, S Apparent duration of sound perception and musical optimum 
reverberation J Acous Soc Amer, 1936, 7, 213-221 
Lorente, de No, R. (1) The reflex contractions of the muscles of the middle 
ear as a hearing test in experimental animals Trans Amer Laryngol , 
Rhraol and Otol Soc, 1933, 26-42 

(2) Anatomy of the eighth nerve The central projection of the nerve 
endings of the internal ear Laryngoscope, 1933, 43, 1-38 

(3) The synapnc delay of the motoneurones Amer J Physiol , 1935, ni, 
272-282. 

Lorente, de No, R, and Harris, A S Experimental studies in hearing I 
The threshold of the reflexes of die muscles of the mtddle ear II The 
hearing loss after extirpation of the tympanic membrane. Laryngoscope, 
1933, 43, 315-326 

Lurie, M H (1) Animal experimentation on hearing its clues to the preven 
tion of deafness Trans Amer Acad Ophthalmol and Otolaryngol, 
1935 

(2) How does the organ of Com distinguish pitch? Ann Otol , Rhinol 
and Laryngol, 1936, 45, 339-350 

Lurie, M H, Davis, H, and Derbyshire, A J The electrical activity of the 
cochlea in certain pathological conditions Ann Otol, Rhinol and 
Laryngol , 1934, 43, 321-344 

LOscher, E. Die Funktion des Musculus stapedius beim Menschen Zsch 
Hals-Nasen u Ohrenheilkunde, 1929, 23, 105-132 
Marnay, A, and Nachmansohn, D Cholin esterase in voluntary muscle 
J Physiol, 1938, 92, 37-47 

Marshall, R. N A non-di recti onal microphone. Bell Lab Rec, 1935, 14, 
34-38 

Marshall, W. H , Woolsey, C N, and Bard, P Representation of tactile 
sensibility in the monkey's cortex as indicated by cortical potentials 
Amer J Physiol, 1937, 119,372-373 

Maxfield, J P. Some physical factors affecting the illusion in sound monon 
pictures J Acous Soc. Amer, 1931, 3, 69-80 
McLachlan, N W. Noise. London Oxford Umv Press, 1935 
Mettessel, M The vibrato m artistic voices Umv Iowa Stud, 1932, 1, 
14-117. 

Meyer, M F. The hydraulic principles governing the function of the cochlea 
} Gen Psychol, 1928, 1,239-265 



46S 


REFERENCES 


Mh.es, W R Accuracy o£ the voice in simple pitch singing Psychol Rev 
Monog, 1914, 16, No 69, 13-66 

Miller, D C (1) The science of musical sounds New York Macmillan, 
1926 

(2) Anecdotal history of the science of sound New York Macmillan, 
1935 

Montgomery, H. C (1) Da our ears grow old? Bell Lab Rcc, 1932, 10, 
311-313 

(2) Influence of experimental technique on the measurement of differential 
intensity sensitivity of the ear J Acous Soc Amer, 1935, 7, 39-13 

Moor., E R An experimental study of visual and auditory thickness 
Amer J Psychol, 1930, 42, 544-560 

Newman, E B The validity of the just noticeable difference as a unit of 
psychological magnitude Trans Kans Acad Sci , 1933, 36, 172-175 

Newman, t B, Stevens, S S, and Davis, H Factors in the production of 
aural harmonics and combination tones J Acous Soc Amer, 1937, 9, 
107-118 

Newman, E. B , Volkmann, J , and Stevens, S S On the method of bisection 
and its relation to a loudness scale. Amer J Psychol , 1937, 49, 134-137 

Olson, H f , and Massa, F (1) Applied acoustics Philadelphia Blakiston s, 
1934 

(2) Performance of telephone receivers as affected by the ear J Acous 
Soc Amer, 1935, 6, 250-254 

Osterhout, W J V , and Hill, S E Electrical variations due to mechanical 
transmission of stimuli J Gen Physiol, 1931, 14,473-485 

Parker, G H Humoral agents in nervous activity with special reference to 
chromatophores Cambridge, England Uruv Press, 1932. 

Parker, G H, and Paine, V L. Progressive nerve degeneration and ns rate 
in the lateral line nerve of the catfish Amer J Anat, 1934, 54, 1-25 

Pattie, F A , Jr An experimental study of fatigue in the auditory mechanism 
Amer J PsychoL, 1927, 33, 39-58 

Pearce, C H The pitch specificity of auditory adaptation J Gen Psychol, 
1935, 12,358-371 

Pfeifer, R. A Die Lokalisation dcr Tonshala mnerhalb der kortikilcn 
Horsphare des Menschcn Mschr Psychian u NcuroL, 1921, 1, 99 

Physical Society Discussion on audition Cambridge England Umv 
Press, 1931 

Pierce, A H Studies in space perception New York Longmans, Green, 
1901 

Pierce, G W Magnetostriction oscillators Proc. Insut Radio Eng, 1929, 17, 
42— S3 

Pieron, H Revue gemlrale dacoustique psychophysiologique. Ann de 
psychol, 1934, 1, 167-197 

Piersol, G A (Editor) Human anatomy, including structure and develop- 
ment and practical considerations Philadelphia Lippmcott, 1907 (6th 
ed, 1918 ) 

Pohlman, A G (1) Acoustic insulation and the cancellation effect at the 
basilar membrane J Acous Soc Amer, 1931, 3, 269-274 



REFERENCES 


467 


(2) Is the Weber interpretation of auditory mechanics correct? Amer J 
Physiol, 1935, 113, 106-107 

(3) The present status of the mechanics of sound conduction in its relation 
to the possible correction of conduction deafness J Acous Soc Amer , 

1936 , 8 , 112-117 

Pol, B van der (1) A new transformation in alternating-current theory 
with an application to the theory of audition Proc Instit. Radio Eng , 
1930, 18, 221-230 

(2) Frequency modulation Proc. Instit. Radio Eng , 1930, 18, 1194-1205 
Poljack, S (1) The connections of the acoustic nerve J Anat, 1926, 60, 
465-469 

(2) The mam afferent fiber systems of the cerebral cortex in primates 
Umv Calif Pub! Anat, 1932, 2 

Polvoct, L. M Histologic variations in the middle and inner ears of patients 
with normal hearing Arch Otolaryngol, 1936, 23, 48-56 
Pratt, C C {1) Bisection of tonal intervals longer than an octave. J Exper 
Psychol, 1928, II, 17-26 

(2) Comparison of tonal distances J Expcr Psychol , 1928, 11, 77-87 

(3) The spatial character of high and low tones J Exper Psychol , 1930, 
13, 278-285 

Pumi>hret, R. J, and Rawpon-Smtth, A F Hearing in insects the nature if 
the response of certain receptors to auditory stimuli Proc Roy Soc. 
London, 1936, 12IB, 18-27 

Ramsdell, D A The psycho-physics of frequency modulation Unpublished 
thesis, Harvard University 

Rawdon-Smith, A F (1) Auditory fatigue Brit J Psychol 1934, 25, 77-85 

(2) Experimental deafness Further data upon the phenomenon of so- 
called auditory fatigue Brit J Psychol , 1936, 26, 233-244 
Rawdon-Smith, A F, and Grindley, G C An illusion in the perception of 
loudness Bnt. J Psychol , 1935, 26, 191-195 
Rayleigh, Lord An instrument capable of measuring the intensity of aerial 
vibrations PhiL Mag , 1882, 14, 186-187 
Reboul, J A Le phenomene de Wever et Bray Montpellier Imprimene 
de la Chante, 1937. 

Reboul, J A Mecamque de 1 oreille interne J de physique, 1938, 9, Scr 7 
Retails G Das Gehororgan der Wirbelthiere. Morphologisch histologische 
Studien Vol I Gehororgan der Fische und Amphibien Vol II 
Das Gehororgan der Reputien, der Vogel, und der Saugetiere Stock 
holm Samson and Wallin, 1881-1884 

Rich, G J (1) A preliminary study of tonal volume J Exper Psychol, 
1916, 1, 13-22 

(2) A study of tonal attributes Amer J Psychol , 1919, 30, 121-164 
Richardson, L. F, and Ross, J S Loudness and telephone current. J Gen 
Psychol, 1930, 3, 288-306 

Riesz, R. R. (1) Differential intensity sensitivity of the ear for pure tones 
Phys Rev, 1928, 31, 867-875 

(2) The relationship between loudness and the minimum perceptible 
increment of intensity J Acous Soc Amer, 1933, 4, 211-216. 



468 


REFERENCES 


Roaf, H E. The analysis of sound waves by the cochlea Phil Mag, 1922, 
43,34 9-354 

Rosenbujeth, A, and Ortiz, T The crossed respiratory impulses to the 
phreme Amer J Physiol, 1936, 117, 495-513 
Sabine, W C. Collected papers on acoustics Cambridge Harvard Umv 
Press, 1922 (See esp appendix pp 277-279 ) 

Saul, L J, and Davis, H Action currents in the central nervous system 
I Action currents of the auditory tracts Arch Neurol and Psychiat, 
1932,28, 1104-1116 

Seashore, C. E (Editor) The vibrato Umv Iowa Stud Psychol Music; 
1932, 1 

Shower, E G, and Biddulfh, R. Differential pitch sensitivity of the ear 
J Acous Soc. Amer , 1931, 3, 275-287 

Sivian, L J A modification of the Rayleigh disk method for measuring 
sound intensities Phi! Mag, 1928, 5, 615-620 
Sivian, L J , and White, S D On minimum audible sound fields J Acous 
Soc, Amer, 1933, 4, 288-321 

Snow, W B Change of pitch with loudness at low frequencies J Acous 
Soc Amer, 1936,8,14-19 

Steinberc J C Positions of stimulation in the cochlea by pure tones J 
Acous Soc Amer , 1937, 8, 176-180 

Steinberg, J C , and Gardner, M B The dependence of hearing impairment 
on sound intensity J Acous Soc Amer, 1937, 9, 11-23 
Steinberg, J C, and Munson, W A Deviations in the loudness judgments 
of 100 people. J Acous Soc. Amer, 1936, 8, 71-80 
Steinberg J C, and Snow, W B Physical factors in auditory perspective. 

Bell System Tech J, 1934, 13, 245-259 
Stiuotl, U Obcr Empfindung und Messung der Lautstarke. Hochfrequcnz- 
technik und Electroakustik, 1933, 41, 116—123 
Stevens, S S (1) The attributes of tones Proc Nat Acad Sci , 1934, 20, 
457-459 

(2) The volume and intensity of tones Amer J Psychol, 1934, 46, 397- 
408 

(3) Tonal density J Expcr Psychol, 1934, 17, 585-592 

(4) Are tones spatial ? Amer 7 Psychol, 1934, 46 145-147 

(5) The relation of pitch tD intensity J Acous Soc Amer, 1935, 6, 
150-154 

(6) The operational definition of psychological concepts Psychol Rev, 
1935, 42, 517-527 

(7) A scale for the measurement of a psychological magnitude loudness 
Psychol Rev, 1936,43,405-416 

(8) On hearing by electrical stimulation J Acous Soc Amer, 1937, 8, 
191-195 

Stevens, S S , and Davis, H Psychophysiologtcal acoustics pitch and loud- 
ness J Acous Soc Amer, 1936, 8, 1-13 
Stevens, S S, Davis, H , and Lurie, M H The localization of pitch percep- 
tion on the basilar membrane. J Gen PsychoL, 1935, 13, 297-315 



REFERENCES 


469 


Stevens, S S, and Gerbrands, R. A twin -oscillator for auditory research 
Amer ] Psychol , 1937, 49, 113-115 

Stevens, S S , and Hunt, F V The sensitivity of the ear to high frequency 
modulated electric currents (Unpublished, 1937 ) 

Stevens, S S, and Newman E B (1) The localization of pure tones 
Proc. Nat Acad Sci„ 1934, 20, 593-596 

(2) The locahzauon of actual sources of sound Amer J Psychol , 1936, 48, 
297-306 

(3) On the nature of aural harmonics Proc Nat Acad Sci , 1936, 22, 
668-672 

Stevens, S S, and Sobel, R. The central differentiation of synchronized 
action potentials in the auditory nerves Amer J Physiol., 1937, 119 
409-410 

Stevens, S S , and Volkmann, J The effect of attitude on the bisection of 
intervals of loudness (Unpublished, 1937 ) 

Stevens, S S , Volkmann, J , and Newman, E B A scale for the measure 
ment of the psychological magnitude pitch J Acous Soc Amer , 1937, 
8 185-190 

Stewart, G W (l) Binaural beats Phys Rev , 1917, 9 502-528 

(2) The function of intensity and phase in the binaural location of pure 
tones Phys Rev, 1920, 2nd Ser , 15, 425-445 

(3) The intensity logarithmic law and the difference of phase effect m 
binaural audition Psychol Monog, 1922, 31, No 1, 30-44 

(4) Problems suggested by an uncertainty principle in acoustics J Acous 
Soc Amer, 1931, 3 , 325-330 

(5) Introductory acoustics New York Van Nostrand, 1933 

Stowell, E Z , and Deminc, A F Aural rectification J Acous Soc Amer 
1934, 6, 70-79 

Strecker, F Die Bemerkbarkcit von Emschwingzeiten Telegr-u Fernspr- 
Techn, 1935,24,1-5 

Stuitlman, O , Jr The nonlinear transmission characteristics of the auditory 
ossicles J Acous Soc Amer, 1937,9 119-128 

Stump?, C. Tonpsychologte Vol I Leipzig Herzet, 1883 (See esp p 
250) 

Tiffin, J , and Seashore, H Summary of the established facts in experimental 
studies on the vibrato up to 1932 Unw Iowa Stud , 1932, 1, 344-376 

Titchener H B (1) Experimental psychology Vol II, Pt u, pp 232-248, 
1905 

(2) A textbook of psychology New York Macmillan, 1910 (See esp 
P 218) 

Trimble, O C (1) The theory of sound localization a restatement PsychoL 
Rev, 1928,35 515-523 

(2) A discrete impulse technique in sound localization Brit J Psychol, 
1928, 19, 167-178 

(3) Some temporal aspects of sound localization Psychol Monog;, 1928, 
28, No 4,172-225 

(4) The relative roles of the temporal and the intensive factors in sound 
localization Amer J Psychol, 1929, 41, 564-576 



470 


REFERENCES 


(5) Intensity-difference and phase-difference as conditions of stimulation in 
binaural sound localization. Amcr J Psychol., 1935, 47, 264-274 

Trimmer, J D , and Fire stout, F A An investigation of subjects e tones by 
means of the steady tone phase effect J Aeous Soc Amer , 1937, 9, 
23-29 

Taocer, J Die Schaliaufnahme durch das aussere Ohr Physifc Esch , 1930, 
31, 26-47 

Troland, L. T (1) The psychophysiology of auditory qualities and attributes 
J Gen Psychol , 1929, 2, 28-58 

(2) Psychophysiological considerations relating to the theory of hearing 
J Acous Soc, 1929-30, 1, 301-310 

Tuiito, P Some experiments and considerations on experimental otology 
and phonetics Bologna Licmio Cappelli, 1929 

Upton, M Differential sensitivity in sound localization Proc Nat. Acad 
Sci, 1936, 22, 409-412 

Upton, M, and Crozier W J On auditory intensity discrimination Proc. 
Nat Acad Sa , 1936, 22, 417-420 

Upton, M, and Holway, AH On the psychophysics of hearing I Mon 
aural differential sensitivity and exposure time II Binaural dtfferen 
tial scnsttivity and exposure time Proc Nat Acad So., 1937, 23, 
29-34 

Valentine, W L. Note on the ' binaural beat” J Comp Psychol , 1927, 7, 
357-368 

Vance, T F Variation in pitch discrimination within the tonal range 
Psychol Monog.,1914 16,115-149 

Walker, A E (1) The projection of the medial geniculate body to die 
cerebral cortex in the macaque monkey J Anat , 1937, 71, 319-331 
(2) The thalamus of the rhesus monkey (Macaca Mulaita ) Chicago 
Umv Chicago Press (rn press 1937) 

Watson F R. Sound New York Wiley, 1935 

Watson, N A Hearing by bone conduction J Acous Soc Amer, 1937, 
99-106 

Wedell, C H The nature of the absolute judgment of pitch J Exper 
Psychol, 1934, 17,485-503 

Wecel,R L. (1) A study of tinnitus Arch Otolaryngol , 1931 14, 158-165 
(2) Physical data and physiology of excitation of the auditory nerve 
Ann. Otol , Rhtnol and Laryngo! , 1932 41 740-779 

Wegel, R. L. and Lane, C E The auditory masking of one pure tone by 
another and its probable relation to the dynamics of the inner ear 
Phys Rev , 1924, 23, 266-285 

Wecel, R. L. Riesz, R. R, and Blackman, R B Low frequency thresholds of 
hiuarnig aitii th Winig iu ‘bit tsn vtzh vn Tuwhnonvni 7 j JtuHft ‘ytA. 
Amer, 1932, 4, 6 

Weinberg, M, and Allen, F On the cnUcal frequency of pulsation tones 
Phil Mag , 1924, 47, 50-62 

Westphal, H Unmittelbare BcsBmmungen der Urfarbcn Zsch f Smnes- 
physiol , 1910, 44, 182-230 

Wever, EG (1) Beals and related phenomena resulting from the simul- 
taneous sounding of two tones. Psychol Rev, 1929, 36, 402-523 



REFERENCES 


471 


(2) Impulses from the acoustic nerve of the guinea pig, rabbit, and bat 
Amer J Psychol , 1931, 43, 457-462 

(3) The physiology of hearing the nature of response in the cochlea 
Physiol Rev, 1933, 13, 40(W25 

(4) A study of hearing in the sulfur winged grasshopper J Comp 
Psychol, 1935,20, 17-20 

Wever, E G , and Bray, C W (1) Action currents in the auditory nerve 
m response to acoustical stimulation Proc Nat Acad So , 1930, 16 
344-350 

(2) The nature of acoustical response The relation between the sound 
frequency and frequency of impulses in the auditory nerve. J Expcr 
Psychol , 1930, 13, 373-387 

(3) Present possibilities for auditory theory Psychol Rev, 1930, 37, 
365-380 

(4) Auditory nerve responses in the reptile. Acta-Otolaryngol , 1931, 16, 
154-159 

(5) A new method for the study of hearing in insects J Cell and Comp 
Physiol, 1933,4,79-93 

(6) The nature of acoustic response. The relation between sound intensity 
and the magnitude of responses in the cochlea J Expef Psychol, 
1936, 19, 129-143 

(7) The nature of bone conduction as shown in the electrical responses of 
the cochlea Ann Otol, Rhinol and Laryngol , 1936 45, 822-831 

(8) Hearing in the pigeon as studied by the electrical responses of the 
inner ear J Comp Psychol, 1936, 22, 353-363 

(9) The perception of low tones and the resonance volley theory 7 
Psychol, 1936, 3, 101-114 

Wever, E G, Bray, C. W, and Horton, G P Localization in the cochlea 
as studied by the stimulation deafness method Ann Otol, Rhinol 
and Laryngol , 1935, 44, 772-777 

Wever, E. G , Bray, C W, and Wili.ey, C F The response of the cochlea 
to tones of low frequency J Exper Psychol , 1937, 20 336-349 

Wever, E. G, and Truman, S R. The course of the auditory threshold in the 
presence of a tonal background J Exper Psychol, 1928, 11, 98-112 

Wigcers, C. J Physiology in health and disease Philadelphia Lea and 
Febigcr, 1937 

Wigcers H C The functions of the intradural muscles Amer J Physiol , 
1937, 120 781-797 

Wightm tN, E R, and Firestone, F A Binaural localization of pure tones 
J Acous Soc Amer, 1930, 2, 271-280 

Wilkinson, G , and Gray, A A Mechanism of the cochlea London Mac- 
millan, 1924 

Wilska, A Eine methode zur Bestunmung dcr Horschwellenamphtuden des 
Trommel fells ba vcrschiedenen Frequenzem Skand. Arch £ Physiol, 
1935, 72, 161-165 

Wilson, H A , and Meyers, C S The influence of binaural phase differences 
on the localization of sounds BriL J Psychol., 1908, 2, 363-385 



472 


REFERENCES 


Wingfield, R. C An experimental study of the apparent persistence of audt 
tory sensations J Gen PsychoL, 1936, 14, 136-157 

Wittmaack, K. fiber Schadigung des Gehors durch Schallemwirkung 
Zsch f Ohrenhk , 1907, 54, 37-80 

Wolff, W Versuche zur Lautstarkeempfindutig Zsch f Psychol, 1935, 
136,325-340 

Wrightson, T, and Keith, A An inquiry into the analytical mechanism 
of the internal e3r London, 1918 

Yotrrz, R. E P , and Stevens, S S On the pitch of frequency modulated 
tones Amer J Psychol , 1938, 50 (in press) 

Zoii, P M The relation of tonal volume, intensity, and pitch Amer J 
Psychol, 1934, 46, 99-106 

ZotmOHL, G Abhangigkeit der Tonhohenempfindung ion dcr Lautstarke 
und lhre Beziehung zur Helmholtzschen Resonanztheorie des Horens 
Zsch f Smnesphysiol , 1930, 61, 40-86 



INDEX OF NAMES 


Abraham, 165, 457 
Ades, 431, 457 
Adrian, 311, 312, 457 
Allen, 231, 470 

American Otological Society, 292, 457 

Andrade, 27, 457 

Andreef, 353, 457 

AngeU, 180, 457 

Ashcroft, 339, 415, 457 

Eachera, 108, 457 
Banister, 218, 457 
Bard, 434, 465 
Barnes, 58, 457 
Barrera, 339, 462 
Barron, 305, 457 
Barton 70, 457 
Bast, 324,335, 336,457 
Beatty, 457 

Bekcsy, 44-46, 53, 58-60, 102, 103, 115, 
116, 120, 126, 146, 147, 154, 163, 
173, 186, 196, 212, 213, 219-223, 
238, 245, 246, 250-257, 262, 263, 
279-281, 286, 293, 294, 332, 357, 
373,457,458,459 
Bell, 359 

Bell Telephone Laboratories, 89, 99, 
140 

Bentley, 462 

Bernstone, xn 

Bezold Briicke, 73 

Biddulph, 86-92, 152, 367-369, 468 

Black, 323, 324, 460 

Blackman, 44, 470 

Boring, vn, xn, 2, 95, 98, 147, 160, 164, 
459, 461 
Bowen, 273, 459 
Boyle, v 

Bray, vn, 46, 252, 295, 310, 315, 320, 
395, 403, 471 
Brecher, 46, 459 
Brocmser, 262 


Brogden, 429, 459 
Bronk, 311, 457 
Bronstein, 218, 220, 459 
Bunch, 67, 68, 459 
Burck, 101-105, 157, 159, 459 
Buytendi;k, 310, 460 

Cannon, 302, 391, 460 
Chapin, 204, 460 
Churcher, 113, 114, 125, 142, 460 
Chnnlova, 218, 459 
Ciocco, 363, 460 

Coppee, 315, 383, 387, 421-425, 464 
Cotugno, 359 
Covcll, 323, 324, 460 
Cowan, 76, 465 

Crowe, 254, 255, 339, 363-365, 370, 
460, 464 
Crozier, 470 

Culler, 361, 370-372, 429, 431, 457, 
459, 460, 461 
Czerny, 58, 457 

Dahmann, 255, 257, 263, 460 
Datta, 21, 460 
Davies, 113, 114, 142, 460 
Davis, A H, 15, 84,445,460 
Davis, H, xu, 97, 152, 190, 197-202, 
207, 262-266, 311, 314, 318-321, 
325-329, 335, 346, 357-362 366- 
370, 382-385, 390, 395, 402-405, 
430, 460, 461, 466, 468 
Dean, 461 
Deming, 23 3, 469 

Derbyshire, 262, 328, 335, 361, 382- 
385, 390, 396, 402-405, 410, 460, 
461 

Donder, vi 
Drysdale, 31, 461 

Einthovcfl, 299, 461 
Ekdahl, 98, 10 1, 461 


47S 



474 


INDEX OF NAMES 


Erlanger, 387, 408, 461 
Ewing, 461 

Eyster, 324, 335, 336, 457 

Fay, 7, 461 
Fechner, 63, 148 
Finch, 361, 461 

Firestone, 34, 82, 113, 114, 172-175, 
204, 205, 460, 461, 462, 470, 471 
Fite, ISO, 457 

Fletcher, 44, 47, 76, 99, 111-130, 185- 
187, 212, 216, 274, 284, 285, 461 
Forbes, xi, 310, 410, 461 
de Forest, vi 
Fourier, v, 19 
Frank, 262 
Frederick, 29, 461 
Fry, 174, 463 

Gage, 121, 462 
Galileo, v 
Galt, 462 

Gardner, 132, 135, 468 
Gasser, 387, 408, 461 
Geffken, 261, 263, 462 
Gei gcr, 82, 113, 114,462 
Gerhrands, 469 
Gcrsum, 67, 353, 457,462 
Girden, 429, 459 
Gray, A , 442, 462 
Gray, A A, 372, 471 
Grindlcy, 142, 467 

Guild, 275, 295, 335, 357, 358, 363-365, 
460, 462, 463 
Guilford, 50, 84, 145, 462 
Gunman, 339, 462 
Gundlach, 462 

Hall, 20, 39, 462 

Hallpike, 265, 273, 322, 334, 339, 340, 
394,412,415,457,462,463 
Halverson, 161, 177,178,463 
Ham, 113, 114, 462,463 
Hardy, 415, 463 
Hams, 253, 465 
Hartley, 174, 463 

Hartndge, 340, 394, 411, 412, 462, 463 
Hartshorn, 411, 463 


Harvard Laboratory, 164 
Helmholtz, v vn, 1, 9, 20, 244, 257, 
263, 297 , 359, 4 63 
Henney, 24, 463 
Hermann, vi 
Hill, A V, 408, 463 
Hill, S E, 298,466 
Holway, 52, 142,463,470 
Holhnsbead, 237, 463 
Horton, 463, 471 
Howe, 335, 463 
Hughes, 52, 464 
Hughson, 254, 327, 464 
Hunt, xu, 24, 355, 464, 469 

Janovsky, 464 
Jasper, 433, 464 
Jones, 293, 464 

Keith, 274, 472 

Kemp, 315, 361. 383, 387, 421-428, 
460, 464 

King, 113, 114, 125, 142, 460 

Knauss, 119, 464 

Knudsen, 85, 136, 292, 464 

Kobrak, 262, 464 

Kock, 464 

Koenig, I, 464 

Kornmuller, 434, 454 

Kotowski, 101-106, 157, 159, 459 

Kramz, 255, 464 

Kranz, 44 

Kucharski, 232, 464 

Lane, 185, 208-217, 443, 464, 470 
Lapicque, 305, 465 
Larsen, 205, 206, 465 
Leighton, xu 

Lewis, 76, 196 205, 206, 465 

Lichte, 101-106, 157, 159, 287, 459,465 

Lifshitz, 154, 465 

Littler, 461 

Loewi, 391 

Lorente de No, 253, 264, 275, 290, 322, 
351, 377, 378, 338, 390, 415, 419, 
424, 465 
Luft, vi, 84 



INDEX OF NAMES 


475 


Lurie, xu, 97, 262, 266, 270, 272, 290, 
314, 328, 335, 336, 357-370, 460, 
461, 468 

Luscher, 264, 465 

Mach, 373 
MacRobert, 442, 462 
Marnay, 392, 465 
Marshall, R N, 33, 465 
Marshall, W H, 434, 465 
Massa, 27, 466 
Matthews, B H C, 305,457 
Matthews, G B , 442, 462 
Maxfield, 183, 465 
McLachlan, 465 
Metfessel, 236, 465 
Mcttlcr, 429, 431, 457, 459, 460 
Meyer, 44, 49, 84, 465 
Meyers, 471 
Miles, 70, 466 
Miller, D C , xi, 2, 19, 466 
Miller, R H, 310,461 
Minton, 44 

Montgomery, 68, 142-144, 466 
Moul, 466 
Muller, vi 

Munson, 44, 47, 11 1-116 123-130,461, 
468 

Nachmansohn, 392, 465 
Newman, 79-83, 96, 121, 149, 176, 
188-202, 207, 319, 466, 469 

O Connor, xi, 310, 461 
Ohm, v, 20, 99 
Olson, 27, 466 
O Neill, xu 
Oruz, 305, 306, -163 
Ostcrhout, 298, 466 

Paine, 305, 466 
Parker, 305, 392, 466 
Parkinson, 113, 114, 463 
Patue, 466 
Pavlov, 219 
Pearce, 466 
Pfeifer, 434, 466 
Phillips, 311, 457 


Physical Society, 466 
Pierce, A H, 176,466 
Pierce, G W, 199,466 
Pieron, 466 
Picrsol, 250, 466 
Pohlman, 254, 466 
van der Pol, 92, 442, 467 
Poljak, 377, 416, 467 
Polvogt, 291, 363-365, 460, 467 
Prat t, 69, 82, 467 
Preyer, v«, 84 
Pumphrey, 399, 467 
Pythagoras, v 

Ramsdell, 236, 439, 467 
Rasmussen, 416 

Rawdon-Smith, xn, 142, 219, 265, 334, 
339, 340, 394, 399, 412, 457, 462, 
463, 467 

Rayleigh, I, 32, 48, 467 
Rtboul, 286,443,444,467 
Reger, 196, 465 
Retzius, 259, 467 
Rich, 161, 467 
Richardson, 113, 467 
Riesz, 44, 137-143, 148, 152, 171 232 
244, 467, 470 
Roaf, 277, 468 

Robinson, 315, 383, 387, 421-428, 464 
Rosenblueth, 302, 305, 306, 391, 460, 
468 

Ross, 113, 467 

Sabine, W C, 17,468 
Saul, 262, 311, 328, 335, 4G1, 468 
Savart, v 
Schaefer, 84 
Seashore, 234, 239, 468 
Sherrington, 404 
Shower, 86-92, 152, 367-369, 468 
Sivian, 43-52, 57, 169, 177,468 
Snow, 74, 75, 168-170 181-183, 468 
Sobel, 116, 172,178,469 
Steinberg, 75, 126, 132, 135, 168-170, 
181-183, 468 
Steudel, 155-159, 468 
Stevens, xu, 41, 65-83, 96, 101, 116- 
122, 149-152, 161-165, 172, 176- 



476 


INDEX OF NAMES 


178, 188-202, 207, 240, 266, 314- 
326, 354, 358, 366-374, 395, 461, 
466, 468, 469, 472 
Stewait, 38, 104, 171, 469 
Stowcll, 233, 469 
Strcckcr, 105, 469 
Stucker, 84 

Scuhlrmn, 196, 250, 257, 258 , 469 
Stumpfj vi, 113, 469 

Tartini, v 
Tiffin, 239, 469 
Trtchener.82, 149,469 
Trimble, 172, 427, 469 
Trimmer, 205, 470 
Troger, 260, 470 
Troland, 165, 403, 404, 470 
Truman, 208, 471 
Tulho, 357, 470 

Upton, 52, 142, 170, 361, 460, 463, 470 

Valentine, 172,470 
Vance, 84, 470 

Volkmann, 79-83, 96, 121, 122, 466, 
469 

Volokhov, 67, 353, 457, 462 

Walker, 416, 418, 470 
Watson, F R-, 470 
Watson, N A, 470 


Weber, 63, 136 
Wed ell, 107, 470 

WegeJ, 44, 58-60, 124, 185, 208-211, 
214,217,323,443,461,470 
Weinberg, 231, 470 
Westphal, 108, 470 

Wever, vii, 46, 208, 242-245, 252, 295, 
310,311, 315,320,321,395,403, 
470, 471 

White, 43, 47-52, 57, 169, 177, 463 

Wien, 48, 261 

Wiggers, C Jt 232 , 471 

Wiggers, H C , 266, 267, 315, 471 

Wightman, 172-175, 471 

Wilkinson, 372, 471 

Willey, 315, 471 

Willman, 460 

Wilska, 55,56,293, 471 

Wilson, H A, 471 

Wilson, J G, 44 

Wingfield, 232, 472 

Wmmg, 327, 464 

Wittmaack, 360, 472 

Wolff. 121, 472 

Wools ey, 434, 465 

Wrightson, 274, 472 

Youtz, 240, 374, 472 

Zoll, 161, 472 
ZurmuM, 70. 472 



SUBJECT INDEX 


Absolute judgment, 82 
Absolute pitch, 107-109 
Acetylcholine, 302, 391-392 
Action-current, 449 
Action potentials, 300-307, 449 
contrasted with microphonic, 311- 
313 

effect on wave-form of microphonic, 
316—317 

combination with microphonics, 
350-351 

in auditory nerve, 379-389 
characteristics of, 380-385 
synchronization of, 393-404, 421 
relation to intensity, 405-409 
relation to size of fiber, 408 
behavior under change of phase, 
411-413 

in higher centers, 420-429 
effect of binaural stimulation, 428— 
429 

Acuity (see Difference lunen) 
Adrenrn, 392 

Age, loss of sensitivity with, 67-68 
Air, distortion due to, 7 
All-or none law, 300-307, 393-404 
Alternation in nerve, 399-404 
Amplifier, vacuum tube, 36 
Amplitude 
of sound wave, 5 
of movement of eardrum, 55-56 
of skull in bone-conduction, 293 
of action potentials, 383, 421-424 
Amplitude-distortion, 13 
Amplitude-modulauon, 231-233 
formulas for, 439 
Amplitude resonance, 12, 445 
Analogue of cochlea, electrical, 443 
Analogy, electro-acoustic, 34 
Analysis of sound, 19 
Analyzer, wave, 20, 39 
Anatomy 


of middle ear, 249-251 
of inner car, 268-276 
of auditory nerve, 376-379 
of higher auditory pathways, 414— 
420 

of cochlear nucleus, 418-420 
Angular velocity, 6 
Anvil (see Incus) 

Apparatus, electrical, 35-41 
Aqueduct, cochlear, 269 
Area 

auditory, 58-63, 152-153 
under electrical stimulation, 66 
at cortex, 417-418 
Brodmann’s, 418 
Artery 
basilar, 271 
internal auditory, 271 
pressure wave in, 282 
Articulation of ossicles, 255-259 
Aspects of sensation, 113-114 
(see also Attributes) 

Asymmetry 
of ear, 192-196 
of ossicles, 259 

of cellular potentials, 303-304 
Atrophy, role in deafness, 364 
Attenuation of sound through head, 
215 

Attenuator, 38 

Attributes of tones, 160-166, 448-449 
Audibility (see Threshold) 
Audiogram, 61-63, 450 
masking, 130, 135-136, 453 
showing high tone deafness, 364 
of effect of surgical damage, 366 
Audiometer, 60-63 
for bone-conduction, 64 
Audiometry, 60-61 
in bone-conduction, 64 
Atifgaben, 436 

Aural harmonics (set Harmonics) 



478 


SUBJECT INDEX 


Aural microphomcs (sec Micro- 
phonia) 

Autonomic nervous system, 391 
Azimuth, 169, 450 

Basilar mechanics, relation to DL's, 97 
Basilar membrane (see Membrane) 
Beat frequencies, 37 
Beats, 16, 241-244, 450 
binaural, loudness of, 115-116 
upper limit of, 172, 178 
relation id synchronized impulses, 
421 

method of "best beats* 184-187 
role in determining DL, 136-141 
optimum rate of, 137 
Bel, 29, 450 

Bessel functions, 441-442 
Bczold Bruckc effect, 73 
Binaural beats (sec Beats) I 

Binaural localization (see Localiza 
tion) 

Binaural masking, 214-215 
Binaural stimulation, action potentials 
due to, 428-429 
Binaural vs monaural 
effect on threshold, 52, 430 
effect on DL’s for frequency, 87 
effect on DL's for intensity, 141-142 
effect on localizauon, 180 
Bisection 

of musical intervals, 82 
of loudness, 120-122 
Bone-conduction, 252, 291-295 
effect on pressure measurements, 54 
sensitivity by, 64 
of sound from car to ear, 215 
threshold of, 292-293 
mechanism of, 293-295 
Brain, activity of gray matter, 308-309, 
433-434 

Brain waves, 433-434 
Brightness, tonal, 165-166 
Brodmanns area, 418 
Brownian movement, 57 

Calibration 
of audiometers, 60, 64 


of frequency meters, 22-24 
of microphones, 33 
pressure, 33 
field, 33 
Canal 

external auditory, 249 
resonance of, 53 
Canals 

semicircular, 249-250, 268-269 
role in bone-conduct ion, 293-294 
stimulation by loud sounds, 58, 
356-357 
Cathode ray, 40 
Cats, albmotic, 335-337 
Cells 

polarization of, 296-298 
orientation of, 304 
(see also Hair-cells and Ganglion 
cells) 

Cerebral cortex 
effect of removal, 429-431 
electrical activity of, 433-434 
Characteristic curve, 14 
of ear. 195 
stability of, 206-207 
Chemical mediators, 302 
role in stimulation of auditory 
nerve 390-392 

Chloroform, effect on acoustic reflex 
266 

Chroma, tone, 108 
Chronaxie, 305 
Circle of reference, 5 
Click, avoidance of, 224 
Clicks 

response of ear to 263, 2S2-284 
masking of 213, 280-28] 
loudness of, 154-159 
microphomcs in response to, 327— 
332 

action potentials in response to, 3S9- 
388 

masking of potential due to, 409- 
411 

effect at cortex, 434 
Clock, synchronous, 22 
Cochlea, 268-276 
dynamics of, 276-2S6, 443-445 



SUBJECT INDEX 


479 


Cochlea, diagram of, 249, 250 
dimensions of, 274 
distortion of hair-cells, 341-345 
Cochlear nucleus, 418-420 
Cochlear response (see M crophomcs) 
Combination tones, 184 197-201,211, 
450 

Complex sounds, pitch of, 98-99 
Complexity of a sound, 6-7 
Components of modulated tones, 225- 
231 

Conditioned reflex 
use of in tests of hearing, 361-362, 
431 

present after decortication, 429-430 
Conduction, in auditory nerve, 380 
Consonance, theory of, 244 
Contours (isophomc), 452 
for equal pitch, 71 
for equal loudness, 123-127 
for equal volume, 161-163 
for equal density, 163-165 
Contraction of intra-aural muscles, 
266-267 

Coordinates, relation of logarithmic to 
linear, 319 
Cortex 

auditory, 417-418, 432-433 
effect of removal, 429-431 
electrical activity in, 433-434 
Corti, organ of, 271-276 

degeneration in otosclerosis, 290 
m deficient animats, 334-337 
Com s ganglion, 269 
number of cells in, 274-275 
Conical factors in fatigue, 219-220 
Cricket, cereal nerve of, 399 
Crystal, piezo, 299 
Cues 

for localization, 167 ff 
for absolute pitch, 107-109 
Cycle, definition, 21, 450 

Damped vibration, 9-10 
Damping 
factor, 10 

in ear as a whole, 263, 330-331 


in cochlea, 278, 286-287, 331, 413, 
445 

of basilar membrane, 286-287 
Deafness, 288-295 
due to age, 68 
variable type, 132-133 
nerve, 133 

relaUon to loudness, 131-136 
types of, 288 

in congenitally deficient animals, 
334-337 

stimulation, 337, 360-363 
boiler maker s, 357 
detonation, 357 
high tone, 363-364 
Decay time of sensation, 222 
Decibel, 29-31, 450 
tables for, 446-447 
Degeneration 
of auditory nerve, 339-341 
Wallenan, 339 

of cochlea from long exposure, 360- 
363 

Deicers’ cells, 272 

Delay at synapse, 307-309, 424-427 
Demodulation, 233 
under electrical stimulation, 355 
Density, tonal, 163-165, 451 
relation to brightness, 166 
Dichotic stimulation, 451 
Difference limen, 451 
for frequency, 84-89 
subjective size of, 95 
relation to basilar mechanics, 97 
367-370 

for intensity, 136-147 
subjective size of, 148-152 
relation to localization, 170-171 
relation to interrupted tones, 232 
relation to beats, 244 
Difference-tones, 197 
phase of, 205-206 
relation to beats, 244 
Differential sensitivity (see Difference- 
kmen) 

Differentiation underlying discrimina 
tion, 435-436 



480 


SUBJECT INDEX 


Diffraction of sound waves by bead, 

53 

Dimensions 
of sound, 3 , 25 
physiological, 306-307 
of sensation, 160-166 
of cochlea, 274 
Discrimination, theories of 
for intensity, 143-147, 151 
for frequency, 372-375 
(see also Difference hmen and 
Sensitivity) 

Dmohitutum, 219 
Displacement 
of particles, 26-27 
of eardrum at threshold, 55-56 
Dissonance, theory of, 244 
Distance^ cues to, 173-175 
Distortion 
definition, 7 

under forced vibration, 13 
amplitude, J3 
frequency, 15 
phase, 15 
of sound field, 33 
threshold for, 200-203 
under electrical stimulation, 353-355 
Distortion potential, 298-300, 449, 451 
in cochlea, 312-351 
mechanism in hair-cell, 342 
Dogs, albinotic, 335 
Drugs, effect in cochlea, 337, 363 
Drum (see Membrane, tympanic) 
Ductus cochlearis, 271-274 
Duration 

effect on pitch, 100-105 
effect on loudness, 154 
effect on DL for intensity, 142 
of action potential, 380 
Dynamics of cochlea, 276-286, 443- I 
445 
Dyne, 451 

Ear 

anatomy of inner, 26S-276 
anatomy of middle, 249-251 
dynamics of inner, 276-278,443-445 
infection of middle, 289-291 


Eardrum, 249 (sec Membrane, tym 
panic) 

Eddies in cochlea, 286 
Electrical stimulation of ear (see Elec- 
trophomc effect) 

Electrodes 
types, 379 

effect on size of action potential. 
406, 422-424 
Electrons 

behavior i« vacuum tube, 36 
behavior in cathode ray oscillo- 
graph, 40-tt 

Electrophonic effect, 65-67, 352-355, 
451 

Electrophysiology, principles of, 296- 
309 

Electrotonus, 305 
Endolymph, 270 
Energy, 28 
density, 455 
flux, 455 

conservation of, 344-345 
Equalloudncss contours (see Con 
tours) 

Equal pitch contours (see Contours) 
Eqm-distances, method of, 120-123 
Equilibration, 302, 396-404, 451 
Esterase, 392 
Eustachian tube, 249-250 
effect of yawning on, 264 
effect of swallowing on, 289 
role in deafness, 289-290 
Excitation 

relation to masking, 215-217 
of nerve fiber, 389-392 
Exponential decay, 10 

Facilitation under binaural stirnuta 
tion, 429 
Fatigue, 217-220 
effect on pitch, 219 
as cquibbrauon, 302, 397-398 
effect on synapse, 307 
of aural microphonics, 325-327 
Fechna's taw, relation to size of DL, 
149 



SUBJECT INDEX 


481 


Feeling 

threshold of, 58-60, 124 
relation of threshold to maximal 
cochlear potential, 322-325 
Fibers of auditory nerve 
relation to hair-cells, 274-276 
resistance of, 408 

mechanism of stimulation of, 389- 
392 

Field-cahbration, 33 
Field, minimum audible, 47-50 
Fifth, subjective size of, 83 
Filter 
acoustic, 38 
electrical, 38 
Flicker, method of, 221 
Fluids, cochlear, 270, 277 
\ortices (eddies) in, 286 
Footplate of stapes, 249-251 
Forced vibration, 8, 451 
Fourier's theorem, 19, 451 
Fractionation 
method of, 79 
of pitch, 79-80 
of loudness, 112-117 
Frequency 
definition, 21, 451 
measurement of, 21-25 
natural, 453 

Frequency-distortion, 12, 15 
Frequency meter, 21, 24 
Frequency modulation, 234-241 
formulas for, 439-442 
Frequency theory of pitch, 359-360 
403 

Fulcrum of stapes, 256 
Fundamental frequency, 452 
effect of elimination of, 99 
Fusion frequency, 46 
Fusion of interrupted tones, 231-233 

Ganglion, Corti’s (spiral), 269 
number of cells in, 274-275 
Ganglion cells, number of, 274 
Ganglion \ entrale, 378 
Geniculate (see Medial geniculate 
body) 

Geometry, electrical of cochlea, 347 


Golgi, method of, 378 
Gradient 

role m discrimination, 372-375 
pressure, in cochlea, 286, 443-445 
extra polar, 305 
Guinea pig, waltzing, 335-337 

Hair-cells 
internal, 272-276 
external, 272-276 
number of, 274 

role in aural microphonics, 333-345 
electrical connections among, 347- 
348 

electrical stimulation of, 352-355 
vulnerability of, 358 
relation to frequency-discrimina 
tion, 369-370 
Hair-cell theory, 333-345 
Hallucinations, 432-433 
Hammer (see Malleus) 

Harmonic, 7, 452 

Harmonics, aural, 184-207, 210,452 
effect on tonal threshold, 46 69 
odd vs even, 190-193 
phase of, 203-205 
effect on waveform of micro- 
phonics, 316-317 
due to electrical stimulation, 354 
Hearing loss, 62, 452 
calculation of, 134-136 
due to loss of ossicles, 252-254 
Hehcotrema, 269 ff 
action of, 276 
Hensen’s cells, 273 
Hooke s law, 5, 7, 9, 13 
applied to ear, 193, 354 
Horsley-Clark instrument, 431 
Hydrogen molecule, diameter of, 56 
Hypothesis, membrane, 297-302 
Hysteresis, 325-327, 398 

Impedance 
acoustic, 35, 449 
of ear, 259-262 
of air, 261 

electrical, of nerve fiber, 408 
of human body, 65 



482 


SUBJECT INDEX 


Impulses (see Clicks or Herve- 

lmpulsc) 

Incus, 249-251 
action of. 257-259 
Inductance, addition of, 78 
Infection of middle ear, 289-291 
Inferior colliculus, 415 
synchronization at, 421 
Inhibition 

due to tonal stimulation, 219 
of inhibition, 219-220 
at synapse, 303 
Injury potential, 297-298 
Inner ear 

anatomy of, 268-276 
dynamics of, 276—2 78, 443-445 
in bone-conduction, 293-295 
Innervation of organ of Com, 275-276 
relation to frequency-discrimination, 
369-370 

Insects, hearing in, 311, 399 
Integration of DL’s 
for frequency, 94-98, 367-370 
for intensity, 147-152 
Intensity 

definition, 25-29, 110, 452 
measurement of, 31-34 
role in localization, 168-171 
Intensity level, 110 
Intensity theory, 168 
Interaction under binaural stimula 
tion, 429 

Interference^ 16-18, 32, 452 
Interrupted tones, 231-233 
Intertone, 243 
Intervals, size of musical, 81 
Jons, 301 
Isljnds, tonal, 63 

Just noticeable difference (sec Differ 
rnce-Jtmen) 

Labyrinth, 268, 27Q 
role in bonc-conduction, 293-294 
stimulation of by loud sounds, 356- 
357 

Lacuna^ tonal, 63 
Lamina, osseous spiral, 269 


Latency 

of nerve impulse, 2S0 
at synapse, 307-309 
of action potential m auditory 
nerve, 311, 316, 328, 381, 385 
of aural microphomcs, 331, 332 
effect of intensity on, 385 
of action potentials in higher path 
ways, 424-427 

relation to binaural localization, 427 
Lateral lemniscus, 415 
action potentials in, 422-424 
crossed fibers in, 428-429 
Ligament 

annular (holding stapes), 256 
spiral, 271 
Limits 

of absolute sensitivity, 56-58 
of sensitivity to frequency, 69-70 
of beats, 242-243 

of frequency in aural microphomcs, 
315-316 

of synchronization of impulses, 394- 
395 

of synochronization in higher ccn 
ters, 421, 426 
Line busy effect, 410 
Linearity of aural microphomcs, 194- 
196, 319-322, 348-351 
Lissajous figures, 23 
Listening standard manner of, 47, 1 1 1 
Localization 
auditory, 167-183 
high low, 69 

effect of intensity 168-171 
effect of phase, 171-172 
effect of time, 173 
threshold of, 170-171 
of actual sources, 1 75— ISO 
with only one ear, 180 
effrrl of movement, 
effect of reverberation, 183 
stereophonic effect, 181-183 
of masked clicks, 281 
of frequency reception on basilar 
membrane, 356-375 
relation to latency of nerve im 
pulses, 427 



SUBJECT INDEX 


483 


Loudness, 1 10-159 
scale of, 112, 118 
fractionation of, 112-117 
of btnaural beats, 116 
equation for, 119 
relation to masking, 127-130 
of multi-component tones, 130-131 
relation to threshold, 131-136 
patterns, 135 
of impulses, 154-159 
relation to auditory nerve impulses, 
407 

due to phase-change, 411-413 
Loudness-function, 117-120 
Loudness-level, 111, 123-125, 453 

Macacus, 417 
Malleus, 249-251 
action of, 257-259 

Map of basilar membrane, 365, 367- 
372 

Masking, 208-217, 453 
by physiological noise, 53-54 
relation to loudness, 127-130 
audiogram, 130, 135-136, 453 
of sound impulses, 213, 281, 410 
binaural, 214-215 
relauon to excitation, 215-217 
central, 214 

effect of mtra-aural muscles on, 267 
of nerve impulses, 280, 382, 386-388, 
409-411 

latency of click due to, 281 
Mass 

effect in cochlea, 277 
electrical analogue of, 34 
Maximal stimulation, theory of, 372— 
375 

Maximum of cochlear potential, 322- 
325 

of size of acuon potential, 395-397 
Meatus, 53 
Mechanics 
of ear, 248-287 
of ossicles, 255-259 
of distortion of hair-cells, 341-345 
Medial geniculate body, 415 
effect of damage to, 431-432 


projection of organ of Corti in, 431— 
432 

Mediator, chemical, 302 

role in stimulation of auditory 
nerve, 390-392 
Medulla 

acuon potentials in 310 
threshold in, 315 
Mel, 81, 453 
Membrane 
basilar, 272-274 
map of, 367-372 
damping of, 286-287, 413 
relation to round window, 253 
tympanic, 249 
area of, 260 

effect of loss of, 196, 252-254 
thickening of, 290-291 
Reissnei's, 271-274 
polarization of, 334, 339 
tcctonal, 273-274 
Membrane hypothesis, 297-302 
Microphone, 39 
nondirectional, 33 
Microphonie, 453 
Microphonics 
cochlear or aural, 310-355 
threshold of, 313-315 
frequency limits of, 315-316 
wave form of, 316-317 
polarity of, 318 
phase of, 318 

relauon to intensity, 319-322 
relation to frequency, 322 
overload of, 325-3 , 7 
hysteresis m, 325-327 
faugue in, 325-327 
relation to impulsive stimuli 
(clicks), 327-331 

relation to traveling waves, 331— 
332 

on gin of, 333-345 
role in hearing, 346 
quantitative relaUons of, 347-351 
as evidence for place theory, 364- 
372 

behavior under change of phase, 
411-413 



484 


SUBJECT INDEX 


Middle ear 
\oJume of, 249 
effect on impedance, 260-262 
anatomy of, 249-251 
infection of, 289-291 
Milhsonc, 129, 455 
Minimum audible field, 46-50 
Minimum audible pressure, 42-46 
Missing fundamental, 99 
Model 

of ossicles, 257-259 
of cochlea, 278-281 
Modiolus, 269 
Modulation, 225-247, 453 
nature of, 225-231 
relation to DL for frequency, 90-93 
relation to measurement of persis- 
tency 221 

formulas for, 439-442 
rectangular, 92, 232 
Monaural effects (see Binaural vs 
monaural) 

Movement, effect on localization, 180 
Movement of head due to loud 
sounds, 356-357 
Multivibrator, 22 
Muscle, smooth, 302-304 
Muscles of middle ear, 250-251 
effect on harmonics, 191-193 
activity of, 263 

effect on transmission, 266-267 
contraction visible as microphomc, 
315 

effect on microphonics, 322 
Musical intervals, size of, 83 
Musical tones, effect of intensity on, 

75-7 6 

Myograph, 263 

Nasopharynx, 249 
Natural frequency, 12 
Natural period, 11 
Nerve 

auditory, 269, 272-276 
degeneration of, 339-341 
electrical stimulation of, 353 
anatomy of, 376-379 
adequate stimulation of, 389-392 


schema for, 402-404 
summation of potentials in, 407- 
409 

phrenic, 306 
cereal, 399 

Nerve fiber (sec Nerve and Fiber) 
Nerve-impulse, 300-309 
relation to sensation, 306-307 
dispersion of, 280, 394 
latency of, 280 
in auditory nerve, 376-413 
in higher pathways, 414-436 
in cochlear nucleus, 420 
in second-order neurons, 420-427 
m third-order neurons, 420-427 
Neural unit 145-147 
Neurons, 453 
primary, 415, 419-427 
second-order, 415, 419-427 
third-order, 415, 419-427 
fourth-order, 415-416, 426-427 
intercalary, 419 

Neurophysiology, principles of, 296- 
309 

Nitclla, 298 
Noise 

physiological, 53 
thermal, 57, 98, 456 
Nonlinearity, 14 
of ear, 192-196 
of ossicles, 259 

of aura! nuciophonics, 348-351 
Normal curve, 145 
Normal ear, 60-61 
Nucleus, cochlear, 377, 418-420 
Number of distinguishable tones, 152— 
153 

Occlusion under binaural stimulation 
429 

Octave, 453 
subjective size of, 83 
Off-effect 
period of, 263 

in aural nucrophomcs, 330-331 
Ohm, acoustic, 449 
Ohm s acoustical law, 20, 203, 241 



SUBJECT INDEX 


485 


On-cffect 

of aural microphonics, 329-330 
of action potentials, 396-397 
Operating point, 193-196, 453 
Order 

of combination tones, J98 
of neurons, 419-420 
Organ of Corti, 271-276 
degeneration m otosclerosis, 290 
in deficient animals, 334-337 
Orientation of cells, 304 
Oscillator, 37 
Oscillograph, 40 
Ossicles, 249-251 
distortion due to, 196 
role in transmission of sound, 252- 
254 

mechanics of, 255-259 
importance of, 259 
adhesion of, 291 
effect of disarticulation of, 295 
Otosclerosis, 290 
Overload, 325-327 
Overtone, 7, 454 

Pam, due to loud sounds, 58-60 
Partial, 7, 454 
Particle-displacement, 26 
at threshold, 56 
Particle-velocity, 27 
at reference intensity, 30 
Pathology, human, 363-364 
Pathways 
auditory, 416 

effects of damage to, 429-431 
Patterns of stimulation 
due to tone, 216 
due to clicks, 282-284 
shift due to intensity, 72-75, 349-350 
Perilymph 270 
Period, 454 
of sound wave, 6 
natural, 453 

refractory, 301-309, 454, 401-404 
of ear, 262-263 

of aural microphonics, 330-331 
Persistence 

of transient vibrations, II 


of sensation, 220-224 
relation to damping of basilar mem 
brane, 287 

Perspective, auditory, 181-183 
Phase 

definition, 6, 454 

sensitivity of ear to, 15-16, 202-206, 
229 

in Lissajous’ figures, 23 
between pressure and velocity 27 
role m localization, 171—172, 178 
relation to binaural beats, 178 
effect on aural harmonics, 202-205 
effect on combination tones, 205-206 
role in modulation, 229-231 
effect of ossicular cham on 253-254 
of aural microp homes, 318 
at stimulation of auditory nenc, 
343, 389 

Phase-change beat, 411-431 
Phase distortion 15 
Phase theory, 168 
Phon, 111,454 
Piezoelectricity, 299-300, 454 
in cochlea, 310-355 
mechanism of arousal m hair-cells 
342 

linearity in hair-cells, 348-351 
Pitch, 69-109, 453 
definition, 21, 70 
limits of, 69-70 
relation to intensity, 70-73, 349 
relation to frequency, 76-81 
of complex sounds, 98 
relation to duration, 100-105 
absolute, 107-109, 449 
effect of fatigue on, 219 
of frequency modulated tones, 239- 
241 

of beating tones 243 
mechanism underlying change with 
intensity, 349-350 
Pitch function, 80 
Pitches 

number of discriminate, 94, 369- 
370 

relation to basilar membrane and 
organ of Corti, 369-370 



486 


SUBJECT INDEX 


Place theory 

relation to pitch-contours, 75 
contrasted with resonance theory, 
359-360, 413 
Polarity 

of microphonics, 318 
of action potentials, 380-382, 389 
Polarization 
of cells, 296-298 

of Reissncr's membrane, 334, 339- 
340 
Potential 
cellular, 296-304 
injury, 297-298 
action (see Action potential) 
distortion (see Distortion potential) 
after, 302 
cortical, 302 

cochlear (see Microphonics) 
streaming, 333 
Potentiometer, 297, 301 
Power (see also Energy and Intensity) 
factor, 65 

generated in cochlea, 345 
Pressure 
alternating, 27 
radiation, 28 
minimum audible, 4’~46 
thermal-acoustic, 57 
Pressure-gradient in cochlea, 286, 443- 
445 

Pressure-level, 454 
Pressure-variations, 27 
at threshold, 27 
Pricking sensation in ear, 58-60 
Probability relation to DL, 143-147 
Projection 

of organ of Corn, 377-378 
of medial geniculate on cortex, 418 
of organ of Com on medial getucu 
Jates, 432 

Psychometric functions, 145 
Psychophysics, methods of, 50 
Psychophysiology, problem of, 306- 
307, 435-436 
Pulse wave in artery, 2S2 
Push pull amplifier, 192 


Quality 

of musical instruments, 107 
of C-ness, 108 
eSect of distortion, 202 
effect of phase, 205-206 
(see also Complexity) 

Quantal processes 
in low frequency threshold, 45 
in intensity discrimination, I43-t47 
Quartz crystal, 299 

Radiations 
auditory, 417 

synchronization of impulses m, 
426-427 

Radio-frequency, stimulation of ear 
by, 354-355 

Ratio of tympanic membrane to 
stapes, 256 
Rayleigh disk, 32, 33 
Recaver, 38-39 
bone-conduction, 292 
Rectification, electrical in car 354-355 
Reference frequency (tone), 111, 454 
Reference intensity, 30, 110 454 
relation to threshold, 54 
Reference pressure, 30, 1 10 
Reflection 

of sound by walls, 16-17 
of sound in tubes, 18 
Reflex 

middle-ear, 253 
homolateral nature of, 264 
nature of, 265 

effect on transmission 266-267 
effect on aural microphonics, 322 
(see also Conditioned reflex) 
Refractory period 301-309, 454 
of auditory nerve, 401-402 
Reissners membrane, 271-274 
polarization of, 334, 339 
Resistance of nerve fiber, 403 
Resonance 8, 455 
curve, 12 
m ear-canal, 53 
of ear, 261 

of basilar membrane, 278 



SUBJECT INDEX 


487 


Resonance-theory, 12, 278, 359-360, 
413, 443 

Resonators, Helmholtz, 20 
Response, cochlear, 311-312 
Microphonics) 

Retrogradcinfluence in ner\ es, 305-306 
Reverberation time, 18 
relation to localization, 183 
Richness, tonal, 236-239 
Rods of Corn, 272-274 
Root mean-square value, 28 
Rotation in nerve, 399-404 
Roughness 

measurement of, 245-247 
of beats, 242, 246 
effect of phase on, 205-206 

Saccule, 271, 415 
Salience 

due to gradients of stimulation, 
372-375 

of stimulation patterns, 216-217 
Seal a (see Anatomy of inner ear) 

Scale 

musical, 24 
of decibels, 31 
of pitch, 79-81 
sensory, 77 
intensive, 78 
numerical, 78-79 
of loudness, 112, 118 
Schema for auditory nerve, 402-404 
Search tube, 44 
Secbeck siren, 165 
Selectivity, 13 

Semicircular canals (see Canals) 
Sensation 
divisibility of, 113 
attributes of, 160 

relation to nerve unpulse, 306-307 
differentiation necessary for, 435- 
436 

Sensation level, 1 10-1 1 1, 455 
Sensation units, 61 
Sensations due to two tones, 210-212 
Sensitivity, differential 
to frequency, 84-89 


to intensity, 136-143 
sanation of, 143-145 
relation to organ of Corn, 369-370 
Sensitivity of ear, 27, 42-68 
relation to intensity, 72 
Sensitization due to stimulation, 220 
Shift of minimum of equal-loudness 
contours, 72-73 
Shot effect, 58 

Side bands (see Modulation) 
Singleness of pitch, 235-239 
Slopes of microphonic functions, 320- 
321, 348-351 
Sone, 119,455 
Sound 

definition, 2, 455 
absorption, 18 
energy density, 455 
energy flux, 455 
pressure, 455 
Sound-cage, 175 
Sound intensity, 29, 455 
Sound shadow of head, 168-170 
Sound stage, 48 
Sound wave, equation, 5 
Spectrum, 455 
of DL for frequency, 89-94 
of short tones, 103-105 
of DL for intensity', 140-141 
of sound impulses (clicks), 159 
282-284 

complexity of, in ear, 199 
of modulated tones, 225-231 
of frequency modulated tones, 44 7 
Standing wa\es, 17, 32 
Stapedius muscle, 250-251 
Stapes, 249-251 
action of, 256-259 
role in bone conduction, 293-295 
Steady state, 1 1 
Stereophonic effect, 181-183 
Stiffness, analogy of, 34 
Stimulation 

pattern of basilar membrane, 216 
phase of, 343 

skewing of pattern on basilar mem 
brane, 72-75, 219, 349-350 



488 


subject index 


Stimulation (Continued) 
hearing by electrical, 65-67, 352- 
355 

of auditory nerve by electricity, 353 
theory of maximal, 372-375 
role of gradients of, 372-375 
of auditory nerve, 389-392 
binaural (effects in higher path 
ways), 428-429 
Stimulus, to hearing, 1-3 
Stirrup (see Stapes) 

Stretch-effect, 299 
String galvanometer, 310 
Subharmomc, 455 

Subjective tones (see Harmonics, 
aural, and Combination tones) 
Subordination, phenomenon of 305 
Successiveness, threshold of, 105-106 
Summation 
of loudness, 115 
at threshold (binaural), 52 
at synapse, 308 

of potentials from hair-cells, 347-348 
of action potentials, 407-409 
of action potentials from binaural 
stimulation, 428-429 
Summation tones, 198 
Super audible frequencies, 199-200 
Sweep circuit, 4t 
Sylvian fissure, 417 
Sympathin, 302, 392 
Synapses 

properties of, 307-309 
delay at, 424-427 
definition of, 455-456 
Synchronization of action potentials 
304, 456 
in medulla, 310 
in auditory nerve, 31 1, 393-404 
m higher centers, 421, 426-427 
at cortex, 433 

Synthesis of sound waves, 20 

Tactual sensation* in ear, 46, 58-60 
Tensor tympani, 250-251 
effect on pitch-contours, 73 
effect on aural harmonics, 191-193 
Testudo, 303 


Theories 

frequency, 359-360, 403 
place, 359-360, 413, 443-445 
resonance, 12, 278, 359-360,413,443 
membrane (of nerve-conduction), 
297-302 

of maximal stimulauon, 372-375 
of gradients, 372-375 
of intensity-discrimination, 143-147, 
151 

of frequency-discrimination, 372- 
375 

Thermal noise, 57, 98, 456 
Thermophone, 43, 44 
Threshold, 42. 456 
pressure, 42-46 
field, 47-50 
for fusion, 46 
for tonal sensation, 46 
statistical nature of, 49 
monaural vs binaural, 52 430 
relation to reference intensity, 54 
amplitude at, 55-56 
for feeling 59 
tactual, 59 
for electric shock, 67 
for successiveness, 105-106 
relation to loudness, 131-136 
for binaural localization, 170-171 
for distortion, 200-203 
effect of ossicles on, 252-254 
for bone-conduction, 292-293 
of cochlear or aural micropliomc, 
313-315 
in medulla, 315 
of internal hair-cells, 369 
of action potentials in auditory 
nerve, 383, 404-405 
effect of electrode position on, 406 
422-424 

Tickle in ear, 58-60 
Timbre, 202 

(sec also Complexity and Attn 
butes) 

Time, binaural difference of, 173 
Time-constant 
definition, 10 
of impulses, 155-159 



SUBJECT INDEX 


489 


Time-error, 147 
Time, reverberation, 18 
Time-signals, 22 
Timertheory, 168 
Tinnitus, 351-352, 356 
Tonality, 108 
Tone 
pure, 2-8 
complex, 2, 6-7 
attributes ot' ttSQ-itia 
Tones, number of, 152-153, 369 
Tonometer, 21 

Trabeculae, role m bone-cond U cUon, 
295 

Transducer, 38, 456 
cochlea as, 334, 352-355 
Transients, II 

in on-effect and off-effect, 329-331 
Transition between tones, effect on 
DL> 142 
Traosnusuan 

characteristic of ear, 193-196 
factor of ear, 263 
Traveling wave (see Wave) 

Trapezoid body, 416 
action potentials in, 421-427 
Trophic influence in nerves, 304-305 
Tube 

infinite, 18 

clastic, dynamics of, 282, 443-445 
vacuum, 1 

Tuber culum acusticum, 377 
Tuning 

of ear (effect of intensity), 72-73 
of middle ear, 266-267 
of inner ear, 276-278, 338, 356-375, 
443445 

electrical., in cochlea, 352-353 
Tunnel of organ of Corti, 271 

Uncertainty, principle of, 103 
Utilization time, 390, 456 
Utnde, 271 

Vacuole, 298 


Valve (vacuum tube), 36 
Vas spirale, 271 
Velocity 
of sound, 7 
of particle, 7, 27, 454 
of nerve impulse, 308, 380-381 
of conduction through synapse, 308 
analogy of, 34 

Velocity microphone, 27 
Vestibule, 249-250, 268-269 
Vibration 

pattern of, due to tone, 284-286 
damped, 10 
Vibrato, 234-241, 456 
Volley theory, 403404 
Voltmeter, 37 
Volume 

tonal, 161-163, 456 
basis of, 164 
Vertices in cochlea, 286 
Vulnerability of acuon potential, 381- 
382 

Wallerian degeneration, 339 
Wave 
sound, 2-6 
standing, 17, 32 

travebng (on basilar membrane), 
278-286 

relaUon of travebng to microphon 
ics, 331-332 

velocity of travebng, 281-282 
Waveform of microphomcs, 316-317 
Wave-length, 8, 32, 456 
Weber Fechner law, 63 
Weber fraction, 149 
Water Brsy effect, 394 
Window 
oval, 249-251, 269 
round, 249-251, 269 
significance of, 254-255 
effect on bone-conduction, 293-295 

Yawning, effect on middle ear, 264 



